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There has recently been considerable interest in the stability of 
different fair bandwidth sharing policies for models that arise in the 
context of Internet congestion control. Here, we consider a connec- 
tion level model, introduced by Massoulie and Roberts [Telecommu- 
nication Systems 15 (2000) 185-201], that represents the randomly 
varying number of flows present in a network. The weighted a-fair 
and weighted max-min fair bandwidth sharing policies are among 
important policies that have been studied for this model. Stability 
results are known in both cases when the interarrival times and ser- 
vice times are exponentially distributed. Partial results for general 
service times are known for weighted a-fair policies; no such results 
are known for weighted max-min fair policies. Here, we show that 
weighted max-min fair policies are stable for subcritical networks 
with general interarrival and service distributions, provided the lat- 
ter have 2 + <5i moments for some Si > 0. Our argument employs an 
appropriate Lyapunov function for the weighted max-min fair policy. 

1. Introduction. We consider a connection level model for Internet con- 
gestion control that was first studied by Massoulie and Roberts [9]. This 
stochastic model represents the randomly varying number of flows in a net- 
work for which bandwidth is dynamically shared among flows that corre- 
spond to the transfer of documents along specified routes. Standard band- 
width sharing policies are the weighted a-fair, a € (0,oo), and the weighted 
max-min fair policies. An important example of the former is the propor- 
tionally fair policy, which corresponds to a = 1. The weighted max-min 
fair policy corresponds to a = oo. These policies allocate service uniformly 
to documents along a given route, and allocate service amongst different 
routes in a "fair" manner. A question of considerable interest is when such 
policies are stable. 
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De Veciana, Lee and Konstantopoulos [5] studied the stability of weighted 
max-min fair and proportionally fair policies; Bonald and Massoulie [2] stud- 
ied the stability of weighted a-fair policies. Both papers assumed exponen- 
tially distributed interarrival and service times for documents. The first con- 
dition is equivalent to Poisson arrivals, and the second condition corresponds 
to exponentially distributed document sizes with documents processed at a 
constant rate. Both papers constructed Lyapunov functions which imply the 
stability of such models when the models are subcritical, that is, the under- 
lying Markov process is positive Harris recurrent when the average load at 
each link is less than its capacity. 

Relatively little is currently known regarding the stability of subcritical 
networks with general interarrival and service times. Massoulie [8] showed 
stability for the proportionally fair policy for exponentially distributed in- 
terarrival times and general service times that are of phase type. A suitable 
Lyapunov function was employed to show stability. 

The stability problem for bandwidth sharing policies is in certain aspects 
similar to the analogous problem for multiclass queueing networks. A signif- 
icant complication that arises in the context of bandwidth sharing policies 
is the requirement of simultaneous service of documents at all links along 
a route. This can reduce the efficiency of service, and complicates analysis 
when the interarrival and service times are not exponentially distributed. 

When the interarrival and service times are exponentially distributed, 
finer results are possible. In Kang et al. [7], a diffusion approximation is 
established under weighted proportionally fair policies. There and in Gromoll 
and Williams [6] , summaries and a more detailed bibliography are provided 
for different bandwidth sharing policies, for both exponentially distributed 
and more general interarrival and service times. 

Here, we investigate the behavior of weighted max-min fair policies for 
subcritical networks whose interarrival and service times have general dis- 
tributions. We show that such networks are stable, provided that the service 
distributions have 2 + 5\ moments for some 5\ > 0. No conclusion is reached 
when fewer moments exist. As in previous papers on stability, we construct 
a suitable Lyapunov function. Because of the more general framework here, 
the Markov process underlying the model will now have a general state space, 
and will require the machinery associated with positive Harris recurrence. 

We next give a more detailed description of the model we consider, after 
which we state our main results. We then provide some basic motivation 
behind their proof together with a summary of the remainder of the paper. 

Description of the model. In the model we consider, documents are as- 
sumed to arrive at one of a finite number of routes r £7Z according to inde- 
pendent renewal processes, with interarrival times denoted by £ r (l), £ r (2), 
Here, £ r (l) are the initial residual interarrival times, and are considered part 
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of the initial state. The remaining variables £ r (2), 6- (3), ■ • ■ are assumed to be 
i.i.d. with mean l/u r , v r > 0, for each r, with the sequences being indepen- 
dent of one another; £ r will denote a random variable with the corresponding 
distribution. The service times of documents are assumed to be independent 
of one another and of the inter arrival times, and have distribution functions 
H r {-) with means m r < oo. The initial state will include the residual service 
times of documents initially in the network. 

On each route r, there are a finite number of links I, where service is 
allocated to the documents on the route. For the models considered in [2, 5, 8] 
and [9], documents on a route r receive service simultaneously at all links 
I on the route, with all such documents being allocated the same rate of 
service A r at all such links at a given time. Associated with such a network 
is an incidence matrix A = (Ai ;r ), I £ C, r G 1Z, with A^ r = 1 if link I lies on 
route r and Ai r = otherwise. When A[ r = A\y = 1, with r/r', the routes 
r and r' share a common link. 

Setting z r equal to the number of documents on route r, A r = X r z r denotes 
the rate of service allocated to the totality of all documents on the route. 
Each link I is assumed to have a given bandwidth capacity c\ > 0. A feasible 
policy requires that this capacity not be exceeded, namely 

(1.1) A l,r A r < Q for all I G C. 

Denoting by A = (A r ) and c = (q) the corresponding column vectors, this is 
equivalent to AA < c, with the inequality being interpreted coordinatewise. 

None of the results in this paper relies on the restriction that either A^ r = 
1 or A\ r = 0. Here, we continue to assume that (1.1) is satisfied, for given A, 
but with the weaker assumption Ai^ r > 0. Under this new setup, each link 
may be interpreted as belonging to every route. A given link I now allocates 
the same rate of service A r to each route r, which utilizes this service at 
rate A^ r . For Ai >r G [0, 1], Ai >r may be interpreted as the proportion of this 
potential service that is actually utilized at link I by route r. 

The traffic intensity p r = v T m r measures the average rate over time at 
which work enters a route r. We say a network is subcritical if 

(1.2) ^ Ai, r p r < ci for all I G C, 

or, in matrix form, Ap < c, where p = (p r ) is the corresponding column 
vector. This corresponds to the definition of subcriticality that is employed 
in the context of queueing networks, where the load at each station (here, 
load at each link) is strictly less than its capacity. Condition (1.2) is needed 
for stability. It is assumed in, for example, [2, 5] and [8]. 

The a-fair and max-min fair policies are examples of feasible policies for 
which the allocation of service to documents at a given time is determined 
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by the vector z = (z r ); the weighted a-fair and max-min fair policies are 
defined analogously, but with a weight w r > assigned to route r. We do not 
define a-fair here, or, in particular, proportionally fair, referring the reader 
to the previous references. Weighted max-min fair (WMMF) is defined as 
a feasible policy that, at each time, allocates service so that 



among nonempty routes 7Z' . That is, the minimum amount of weighted 
service each document receives is maximized, on r with z r > 0, subject to 
the constraint (1.1). 

As defined above, a WMMF policy always exists, although it need not 
be unique, since there may be some flexibility in allocating service among 
those routes where documents are receiving more than the minimal amount 
of service. Since our results apply to all such policies, we will not bother to 
select a "best" member that, for instance, maximizes service on the routes 
that are already receiving more than the minimum service. Such a "best" 
policy can be obtained by solving a hierarchy of optimization problems, 
as mentioned above display (2) in [5]. [By employing the convexity that is 
inherent in the constraint (1.1), it is routine to verify the existence of such 
policies.] 

Since the vector z of documents changes as time evolves, so will the al- 
location of service. From this point on, we reserve the notation X r (t) and 
A r (t) for the allocation of service for a WMMF policy at time t. We find it 
useful to also introduce 



with (1.3) in mind. Between arrivals and departures of documents, A(-) = 
(A r (-)) and X w (-) will be constant; we specify that they be right continuous 
with left limits. 

The state of the network evolves over time as documents arrive in the 
network, are served, and then depart. For networks with exponentially dis- 
tributed interarrival and service times and an assigned policy, z = (z r ) suf- 
fices to describe its state. As with queueing networks, one needs to specify 
the residual interarrival and service times in general. With this in mind, we 
employ the notation z r (B r ) to denote the number of documents on route 
r that have residual service times in B r C M + , and u r to denote the resid- 
ual interarrival time for r, with z(B) = (z r (B r )), B = (B r ), and u = (u r ) 
denoting the corresponding vectors. Setting 



(1.3) 



mmf\ r /w r } is maximized, 



(1.4) 



X w (t) 



min{A r (t) lw r 



(1.5) 




the state x contains this information. We will employ X(t),Z(t, •) and U(t) 
for the corresponding random states at time t. The natural metric space S 
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that corresponds to the states x is no longer discrete. We will describe S in 
more detail in Section 2. 

One can specify a Markov process X(-) on S that corresponds to the 
network with the assigned WMMF policy. The process X(-) is constructed 
in the same manner as is its analog for a queueing network. More detail 
is again given in Section 2. We note here that since S is not discrete, the 
notion of positive recurrence needs to be replaced by that of positive Harris 
recurrence. When X(-) is positive Harris recurrent, we will say that the 
network is stable. 

In order to demonstrate positive Harris recurrence for X(-), we will define, 
in Section 3, an appropriate nonnegative function, or norm, \\x\\, for x € S. 
It is defined in terms of the norms \x\l, \x\r and \x\a, by 

(1.6) = \x\l + \x\r + \x\a- 

Without going into detail here, we note that \x\l and \x\r are defined from 
z(-), where \x\l, in essence, measures residual service times smaller than 
N, for a given large N, \x\r measures residual service times greater than 
N, and \x\a is a function of the largest residual interarrival time. (When a 
distribution function H r has a thin enough tail, we actually replace N by a 
smaller value Nh t that depends on H r .) As one should expect, as either the 
total number of documents Yl r z r — > oo or \u\ — > oo, then ||x|| — > oo. 

Main results. We now state our two main results. 

Theorem 1.1. Suppose that a subcritical network with a weighted max- 
min fair policy has interarrival times with finite means and service times 
with 2 + 5\ moments, 6\ > 0. For the norm in (1.6), and appropriate N,L 
and £i > 0, 

(1.7) E X [\\X(N 3 )\\] < (||x|| VI) -eiiV 2 forallxeS. 

Inequality (1.7) states that, for large X(-) has an average negative 
drift over [0, iV 3 ] that is at least of order 1 /N. This rate will be a consequence 
of the application of N in the construction of the norm \x\l that appears in 
(1.6). 

The reader will recognize (1.7) as a version of Foster's criterion. It will 
imply the positive Harris recurrence of X(-), provided that the states in S 
communicate with one another in an appropriate sense. Petite sets are typ- 
ically employed for this purpose; they will be defined in Section 2. A petite 
set A has the property that each measurable set B is "equally accessible" 
from all points in A with respect to a given measure. 
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Theorem 1.2. Suppose that a subcritical network with a weighted max- 
min fair policy has interarrival times with finite means and service times with 
2 + 5\ moments, Si > 0. Also, suppose that Al = {x : \\x\\ < L} is petite for 
each L > 0, for the norm in (1.6). Then, X() is positive Harris recurrent. 

Theorem 1.2 will follow from Theorem 1.1 by standard reasoning. More 
detail is given in Section 2. 

A standard criterion that ensures the above sets Al are petite is given by 
the following two conditions on the interarrival times. The first condition is 
that the distribution of £ r (2) is unbounded for all r, that is, 



The second condition is that, for some l r E Z + , the (l r — l)-fold convolution 
of £ r (2) and Lebesque measure are not mutually singular. That is, for some 
nonnegative q r () with f °° q r (s) ds > 0, 



for all c<d. When the interarrival times are exponentially distributed, both 
(1.8) and (1.9) are immediate. More detail is given in Section 2. 
We therefore have the following corollary of Theorem 1.2. 

Corollary 1.1. Suppose that a subcritical network with a weighted 
max-min fair policy has interarrival times with finite means that satisfy 
(1.8) and (1.9), and service times with 2 + 8% moments, 5\ > 0. Then, X(-) 
is positive Harris recurrent. 

Outline of the paper and main ideas. In Section 2, we will provide a brief 
background of Markov processes and will summarize the construction of the 
space S and Markov process X() described above. We will also provide 
background that will be employed to derive Theorem 1.2 from Theorem 
1.1 and to obtain Corollary 1.1. The machinery for this is standard in the 
context of queueing networks; we explain there the needed modifications. 

The remainder of the paper is devoted to the demonstration of Theorem 
1.1. (One minor result, Proposition 3.1, is needed for Theorem 1.2.) In Sec- 
tion 3, we will specify the norms | • |l, | • |r and | • \a that define || • || in (1.6). 
Employing bounds on these three norms that will be derived in Sections 4, 
5 and 10, we obtain the conclusion (1.7) of Theorem 1.1. 

For large ||x||, at least one of the norms \x\v, with V equal to L,R or 
A, must also be large. When \x\y is large for given V, it will follow that 
£^[1 X(./V 3 )|y] — \x\v is sufficiently negative so that (1.7) will hold. 

The analysis for \ ■ \ a is straightforward and is done in Section 4. The 
behavior of -E a: [|X(A r3 )|/j] — \x\r is analyzed in Section 5. The remaining 



(1.8) 



P(&(2) >s)>0 



for all s. 



(1.9) 
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five sections are devoted to analyzing E^flA^iV 3 )^] — \x\l- In the last two 
cases, one needs to reason that, in an appropriate sense, the decrease in 
residual service times of existing documents more than compensates for the 
increase due to arriving documents, thus producing a net negative drift. 

For such an analysis, it makes sense to decompose the process X(-) into 
processes X(-) and X (■), with 

X(t) = X{t) + X A (t) for all t. 

The process X(t) is obtained from X(t) by retaining only those documents, 
the original documents, that were initially in the network, and X A (t) consists 
of the remaining documents. Neither X(-) nor X A (t) is Markov. One defines 
Z(t,B) and Z A (t,B) analogously to Z(t,B). 

Because of the WMMF policy, all documents that remain on a route r, 
over the time interval [0,i], receive the same service A r (i), with A r (<) = 
J" X r (t')dt'. Consequently, 

(1.10) Z r (t,B) = z r (B + A r (t)) for t >0,r eK, B CR+ . 

The norms \ ■ \l and \ ■ \r will be defined so that documents with greater 
residual service times contribute more heavily to the norms. On account of 
(1.10), |X(t)|i and will therefore decrease over time; one can also 

obtain upper bounds on \X A (t)\ L and \X A (t)\ R . One can use this to obtain 
a negative net drift on E X [\X(N 3 )\ L ] - \x\l and E X [\X(N 3 )\ R ] - \x\r, as 
mentioned earlier. 

Only limited use of inequalities arising from (1.10) is needed in Section 5 
for | • \fj. More detailed versions are needed for | • |l, which are presented in 
the first part of Section 6. 

In Section 6, we also introduce the sets A(t), along which we will be able 
to obtain good pathwise upper bounds on We show in Section 6, 

by using elementary large deviation estimates, that the probabilities of the 
complements A(t) c are small enough so that 

E x [\X(N 3 )\ L -\x\ L] A(N 3 ) c ] 

is negligible with respect to £^[|X(iV 3 )|£] — \x\l- 

Sections 7-10 analyze the behavior of \X(N 3 )\i on A(N 3 ). Section 7 
considers the contribution to |X(iV 3 )|£ of residual times s > Nff r ', Nh t was 
mentioned parenthetically after (1.6) and satisfies Nh t < N. Sections 8 and 9 
consider the contribution to |X(./V 3 )|£ of residual times s < Nn r . In Section 
8, this is done for A r (iV 3 ) > 1/6 3 , for given r, with the constant b introduced 
in (3.3). Here, service of individual documents is intense enough to provide 
straightforward upper bounds for \X(N s )\l — \x\l- 

Section 9 considers the case with A r (iV 3 ) < 1/6 3 . This is the only place in 
the paper where the subcriticality of the network is employed; estimation for 
|A^(A^ 3 )|i must therefore be more precise. The short Section 10 combines the 
results of Sections 6-9 to give the desired bounds on £y|X(iV 3 )|£] — \x\l- 
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Notation. For the reader's convenience, we list here some of the notation 
in the paper, part of which has already been employed. We set H r (s) = 1 — 
H r (s); quantities such as H*(s) and are defined analogously in terms 

of H*(s) and 3>*(s), which will be introduced later on. The term x indicates a 
state in S and the corresponding term X(t) indicates a random state at time 
t; z(-) and Z(t,-), and u and U(t) play analogous roles. We will abbreviate 
A r = A r (iV 3 ) and set i r (s) = s + A r ; i r (s) is the initial residual service time 
of an original document that has residual service time s at time iV 3 . We 
employ C\, C2, ■ ■ ■ and e\, £2, ■ ■ ■ for different positive constants that appear 
in our bounds, whose precise values are unimportant. The symbols Z + and 
IR + denote the positive integers and positive real numbers, and = Z + U 
{0}; \y\ and \y\ denote the integer part of y S ffi + and the smallest integer 
n with n > y; and c V d and c A d denote the greater and smaller value 
of c,d € R. The acronyms LHS and RHS will stand for "left-hand side" 
and "right-hand side" when referring to equations or inequalities. Since the 
paper is devoted to demonstrating Theorems 1.1 and 1.2, we will implicitly 
assume that the network under consideration has a WMMF policy, except 
when stated otherwise, and that the moment conditions on the interarrival 
and service times given in Theorem 1.1 hold. We assume the network is 
subcritical only when explicitly stated. 

2. Markov process background. In this section, we provide a more de- 
tailed description of the construction of the Markov process X(-) that un- 
derlies a WMMF network. We then show how Theorem 1.2 and its corollary 
follow from Theorem 1.1. Analogs of this material for queueing networks 
are given in Bramson [1]. Because of the similarity of the two settings, we 
present a summary here and refer the reader to [1] for additional detail. 

Construction of the Markov process. As in (1.5), we define the state 
space S to be the set of pairs x = (z(-),u), where z(-) = (z r (-)) and z r (-) is 
a counting measure that maps B C R + to and u = (u r ), r £ 1Z, has 

positive components. Here, z(-) corresponds to the residual service times of 
documents and u to the residual interarrival times. (One could, as in (4.1) of 
[1], distinguish documents based on their "age," which is not needed here.) 

For the purpose of constructing a metric d(-,-) on S, we assign to each 
document the pair (n, Sj), i = 1, 2, . . . , where rj G {1, . . . , \1Z\} denotes its 
route and Sj > its residual service time. Documents are ordered so that 
s\ < S2 < ■ • ■ , with the decision for ties being made based on a given ordering 
of the routes. When i exceeds the number of documents belonging to x, we 
assign the value (rj,Sj) = (0,0). For x,x' G S, with the coordinates labeled 
correspondingly, we set 



00 



(2.1) 



d(x,x') = J^flfj 



rl\ + \ Si - 41)^1) + J2 



1=1 



r 
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One can check that d(-,-) is separable and locally compact. (See page 82 
of [1] for details.) We equip S with the standard Borel cr-algebra inherited 
from d(-, •), which we denote by 5? . In Proposition 3.1, we will show | • |l, 
| • \r and | • \a are continuous in d(-, •). 

The Markov process X(t) = (Z(t,-),U(t)) underlying the network, with 
Z(t, •) and U(t) taking values z(-) and u as above, is defined to be the right 
continuous process whose evolution is determined by the assigned WMMF 
policy. Documents are allocated service according to the rates A r (-), which 
are constant in between arrivals and departures of documents on routes. 
Upon an arrival or departure, rates are re-assigned according to the policy. 
We note that this procedure is not policy specific, and also applies to a-fair 
policies. By modifying the state space descriptor to contain more informa- 
tion, one could also include more general networks. 

Along the lines of page 85 of [1], a filtration (Ft)-, t G [0, oo], can be as- 
signed to X(-) so that X(-) is Borel right and, in particular, is strong Markov. 
The processes X(-) fall into the class of piecewise-deterministic Markov pro- 
cesses, for which the reader is referred to Davis [4] for more detail. 

Recurrence. The Markov process X(-) is said to be Harris recurrent if, 
for some nontrivial cr-finite measure <p, 



where t]b = f£°l{X(t) £B}dt. If X(-) is Harris recurrent, it possesses a 
stationary measure tt that is unique up to a constant multiple. When tt is 
finite, X(-) is said to be positive Harris recurrent. 

A practical condition for determining positive Harris recurrence can be 
given by using petite sets. A nonempty set A S is said to be petite if for 
some fixed probability measure a on (0, oo) and some nontrivial measure v 



for all x £ A and B £ 5? . Here, P t (-,-), t>0, is the semigroup associated 
with X(-). As mentioned in the Introduction, a petite set A has the property 
that each set B is "equally accessible" from all points x G A with respect to 
the measure v. Note that any nonempty measurable subset of a petite set 
is also petite. 

For given 5 > 0, set 



<f(B) > implies P x (rjB = oo) = 1 



for all x 6 S, 



on (S, S") 




T B (5) = M{t>5:X(t)£B} 



and tb = t#(0). Then, tb{$) is a stopping time. Employing petite sets and 
t b (S), one has the following characterization of Harris recurrence and posi- 
tive Harris recurrence. (The Markov process and state space need to satisfy 
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minimal regularity conditions, as on page 86 of [1].) The criteria are from 
Meyn and Tweedie [10]; discrete time analogs of the different parts of the 
proposition have long been known; see, for instance, Nummelin [11] and 
Orey [12]. 

Theorem 2.1. (a) A Markov process X(-) is Harris recurrent if and 
only if there exists a closed petite set A with 

(2.2) P x (t a <oo) = 1 forallxeS. 

(b) Suppose the Markov process X(-) is Harris recurrent. Then, X(-) is 
positive Harris recurrent if and only if there exists a closed petite set A such 
that for some 5 > 0, 

(2.3) su V E x [t a (5)]<cx. 

One can apply Theorem 2.1, together with a stopping time argument, to 
show the following version of Foster's criterion. It is contained in Proposition 
4.5 in [1]. 

Proposition 2.1. Suppose that X(-) is a Markov process, with norm 
|| • ||, such that for some e > 0, L > and M > 0, 

(2.4) E x [\\X(M)\\]<(\\x\\V L)-e for all x. 
Then, forO<5<M, 

M 

(2.5) E x [t Al {5)\<— (M VL) for all x, 

where Ai = {x: \\x\\ < L}. In particular, if Al is closed petite, then X(-) is 
positive Harris recurrent. 

Theorem 1.2 and its corollary. Proposition 2.1 and Theorem 1.1 provide 
the main tools for demonstrating Theorem 1.2. We also require Proposition 
3.1, which states that the norm || • || in (1.6) is continuous in the metric d(-, •), 
and hence that Al = {x : ||x|| < L} is closed for each L. Together, they give 
a quick proof of the theorem. 

Proof of Theorem 1.2. From the conclusion (1.7) in Theorem 1.1, 
we know that the assumption (2.4) in Proposition 2.1 is satisfied for some L, 
with M = iV 3 and e = E\N 2 . In Theorem 1.2, it is assumed that Al is petite; 
by Proposition 3.1, we also know it is closed. So, all of the assumptions in 
Proposition 2.1 are satisfied, and hence X(-) is positive Harris recurrent. □ 

Corollary 1.1 follows immediately from Theorem 1.2 and the assertion, 
before the statement of the corollary, that the sets Al are petite under the 
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assumptions (1.8) and (1.9). A somewhat stronger version of the analogous 
assertion for queueing networks is demonstrated in Proposition 4.7 of [1]. 
(The proposition states that the sets A are uniformly small.) The reasoning 
is the same in both cases and does not involve the policy of the network. The 
argument, in essence, requires that one wait long enough for the network to 
have at least a given positive probability of being empty; this time t does not 
depend on x for ||x|| < L. One uses (1.8) for this. By using (1.9), one can also 
show that the joint distribution function of the residual interarrival times has 
an absolutely continuous component at this time, whose density is bounded 
away from 0. It will follow that the set Al is petite with respect to v, with 
a chosen as the point mass at t, if u is concentrated on the empty states, 
where it is a small enough multiple of 1 1Z | -dimensional Lebesque measure 
restricted to a small cube. 

3. Summary of the proof of Theorem 1.1. As mentioned in Section 1, 
the norm || • || in Theorem 1.1 consists of three components, with 

(3.1) = \x\l + \x\r+ \x\a 

for each x. After introducing these components, we will state the bounds 
associated with each of them that we will need, leaving their proofs to the re- 
maining sections. We then show how Theorem 1.1 follows from these bounds. 

Definition of norms. We first define \x\l- This requires a fair amount of 
notation, which we will introduce shortly. We begin by expressing \x\l in 
terms of this notation; when the notation is then specified, we motivate it 
by referring back to \x\l- 

We set \x\l = sup r s \x\ r>s for r G 1Z and s > 0, where 



We need to define the terms H*(-), z*(-), T(-), a and sjy- 

Starting with H*{-) and z*,(-), we recall the distribution functions H r (-) 
and counting measure z r (-) from Section 1. In (3.2), we will require their 
analogs H*(-) and z*(-) to have densities with bounded first derivatives and 
to be "close" to H r (-) and z r {-). For this, we define H*(-) and z*(-) as the 
convolutions of H r (-) and of z r (-) by an appropriate distribution function 
<&(•) with density </>(•). Setting 



(3.2) 



w r (l + cisn)z*(s) 
iy r T(H*(s N )) 



(3.3) 




for s > 1/6, 
for s G (0,1/6], 
for s < 0, 
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for b£ Z+ with b > 2, <£(•) is the density of $(s) = f*^ 4>(s') ds' . We note 
that <£(•) has mean at most 2/6 and that </>(•) satisfies 



(3.4) 0'(s)<6 2 and <f){s + s') / <j){s) > e~ bs 

for s,s' > 0. The above properties and the exponential tail of cp(-) will be 
useful later when analyzing | • \i and | • \r [as in (3.7), (5.28), (6.39), (6.40) 
and (9.25)]. 

Convoluting by $(•), we set 

/•oo 

(3-5) J ° 

poo 

z r *((0, S ]) = (z r *$)((0,s]) = / <S>(s-s')dz r ((0,s'}) 



with z*(B) being defined analogously for B C R+. Differentiating both quan- 
tities in (3.5), we also set 

/>oo 

h* r (s) = (H r *$)'(s)= / (j)(s- s')dH(s'), 
(3-6) J ° 

<(s) = (z r **)'((0,s])= / <t>(s-s')dz r ((0,s'}). 



Convolution by $(•), as in (3.5), produces a measure z*(-) that approximates 
z r (-) and possesses a density. 

Since H r (-) is assumed to have a finite (2 + di)th moment for all r, the 
same is true for H*(-). This implies that for appropriate C\ > 1, 



(3.7) F r * (s) < ^ ; ^ 2+gi for all s > and r GK, 



for <5i chosen as in Theorem 1.1. We assume wlog that 6± < 1. Since the 
difference of the means of H*(-) and -ff r ( - ) is at most 2/6 for each r and 
if(-) is subcritical, H*(-) will also be subcritical for large enough b. 
We set 

(3.8) F(a) = a + C 2 aa"< for a €[0,1]. 

We choose 7 S (0,<5i/24], C2 > 2C1/7 an d a small enough so that a < 
(I/C2) A 1 and (9.4) is satisfied. One can think of T(-) as being almost 
linear for values of a that are not too small; the power 7 needs to be small 
in order to be able to bound \x\ r>s later on for H*(sn) small; 7 > is needed 
so that the integral in (9.7) is finite. 

We set sn = s A (N Hr + 1) for TV £ Z+, where 

(3.9) N Hr = (H:)~\l/N A )AN. 
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It follows that 

(3.10) 1/N 4 <HZ(N Hr )<Ci/N 2+Sl . 

If H*(-) has a relatively fat tail, say H*(s) ~ s~ 3 , (3.9) implies that Nu r = N; 
otherwise, N Hr < N and H*(N Hr ) = 1/iV 4 . In either case, it will follow from 

(3.10) that F(H*(Nj{ r )) is "large enough" for us to adequately bound |x| rjS . 
We will assume that N € Z + is chosen large enough so N > 1/a and Njj r > 1 
for all r. 

The norm | • \l has been defined with the following motivation. As the pro- 
cess X(-) evolves, documents arrive at each route, are served, and eventually 
depart. In Proposition 9.2, we will show that, under certain assumptions for 
X(t) on t £ [0, iV 3 ], for large enough b, 

(3.11) X w (t)> {l + e 2 )/\x\ L onte[0,iV 3 ], 

for some £2 > 0, because of the subcriticality of H*(-). Reasoning as below 
(1.10), this will imply that individual documents receive enough service so 
that |A"(£)|£ decreases on average over [0, N 3 ]. More specifically, the increase 
in the term T(H*(sn)) in (3.2), after translating sjy according to the ser- 
vice of documents, will compensate for the arrival of new documents. For 
documents with residual service s < Nu r <N + l&tt = 0, the term 1 + as^ 
in (3.2), after translating sjy according to the service of documents, will 
decrease sufficiently over [0,iV 3 ] to produce the term —e\N 2 in (1.7). For 
documents with residual service s > Nn r , we will instead need to employ 
the norm | • which we introduce next. (On (NH r ,Nn r + 1], the intervals 
overlap.) 

The norm | • \r in (3.1) is given by 

/•oo 

(3.12) \x\ R = M 1 J2^N >r N r ( S )z* r (s)ds. 

r JN Hr 

We need to identify the terms kn,t, N t (-) and M\. We set 

(3.13) K N!r = l/T(H:(N Hr )) 
and 



(3.14) 




for s > N, 
for s < N. 



Later on, we will also employ kn = max r Kjv jr .. For the term Mi, we will 
require that 

(3.15) Mi > 8C3 (maxw r /w r >) , 

where C3 is chosen as in (3.30). 
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Since | • \r is given by a weighted sum of the residual service times of the 
different documents, it will be easier to work with than | • \l, which is a 
supremum. For smaller values of s, we required \ ■ \l because of the nature 
of the WMMF policy. Because of the bound on H*(-) in (3.7), the impact of 
large residual service times on the evolution of X(-) will typically be small, 
and so one can employ the "more generous" definition over (-/V# r , oo) given 
in (3.12). 

As we will see in Section 5, we will require the presence of the term N r (s) 
in the integrand in (3.12) to ensure that the integral decreases sufficiently 
rapidly from the service of documents when the integral is large. This will 
rely on N^(s) > 1 on (iV# r ,oo). For s > N, the denominator N in s 2 /N 
is needed so that the expected increase due to arrivals does not dominate 
the term —eiN 2 in (1.7), which was mentioned in the motivation for the 
definition of | • \l- This denominator is not needed for s € [Njj r ,N) because 
(3.9) will guarantee that the integrand is already sufficiently small there. 
The terms ktv,t- are needed when we combine the norms \ ■ \l and | • \r in 
|| • ||, because of the denominator T(-) in | • | riS . 

The norm | • \a in (3.1) is needed for the residual interarrival times. It is 
given by 

(3.16) |xU = — max#(u r ), 

N r 

where 0(y), y > 0, satisfies the following properties. We assume that 6{y) > 
for all y and that #(•) and #'(•) are strictly increasing, with 

(3.17) 6'(y) -too as y -too. 
We also assume that 

(3.18) 9{y) < y 2 for all y, 
and that #(•) grows sufficiently slowly so that 

(3.19) E[0(£ r )]<oo for all r. 

Since E[£ r ] < oo, it is possible to specify such #(•) that also satisfy the 
previous two displays. 

The above properties for #(•) will enable us to show that the expected 
value of will decrease over time when |X(t)|,4 is large. In particular, 

because of (3.17) and (3.19), the decrease in | • \a due to decreasing residual 
interarrival times will, on the average, dominate the increase in | • |a due 
to new interarrival times that occur when a document joins a route. The 
argument for this is given in Section 4 and is fairly quick. We note that 
when £ r are all exponentially distributed, the term | • 1^ may be omitted in 
the definition of || • II. 
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The reader attempting to understand the norm || • || should first concen- 
trate on | • \l, which was chosen to accommodate the WMMF policy. When 
the service distributions H r () all have compact support and the interarrival 
times are exponentially distributed, one may, in fact, set ||x|| = \x\l for a 
large enough choice of N. 

We note that the norm | • |l is not appropriate for weighted a-fair policies. 
In particular, the supremum and the function T(-) in its definition are not 
appropriate factors in this context. On the other hand, | • \r, with suitable 
Mi, and | • | ^4 should still be applicable to a-fair policies, provided a suitable 
replacement of | • \l can be found. 

In order to apply Proposition 2.1 in the proof of Theorem 1.2 in Section 
2, we needed to know that the sets Al = {x: \\x\\ < L} are closed. For this, 
it suffices to show the norm || • || is continuous in the metric d(-,-) that is 
given in (2.1). 

Proposition 3.1. The norm || ■ || in (3.1) is continuous in the metric 
d(, •) given by (2.1). 

Proof. It suffices to show | • |l, | • \r and \ ■ \ a are each continuous in 
d(-,-). For | • \l, note that the coefficients of z*(s) in (3.2) are bounded. On 
the other hand, if d(x, x') < e < 1, then one can show, by using the first part 
of (3.4), that 

(3.20) \z*(s) - 4'* (a) | < b 2 e for all s and r, 

where z' T -* = (z')j!(s). It follows from this and (3.2) that | • \l is in fact Lips- 
chitz in d(, •). 

For \ ■ \r, one can apply both parts of (3.4) to show with a bit of work 
that, if d(x,x') < e < 1 and x has no residual service times greater than M, 
for given M, then 

(3.21) / N r (s)\z* r (s)-z';*(s)\ds<(M + l) 2 b 2 e + (l-e- be )\x\ R 

JN Hr 

for all r. Since the coefficients of f£f N r (s)z*(s) ds in \x\r are bounded and 
the RHS of (3.21) goes to as e — > 0, the continuity of | • \r follows. 

Since 8'(u r ) is bounded for bounded values of u r , \ ■ \a is also continuous. 

□ 

In addition to the norms in (3.1), we will employ the following norms in 
showing Theorem 1.1: 

(3.22) \x\ =^z r (M + ) = ^z;(M + ) 

r r 
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and 

(3.23) \x\ K = ^K N , r z*((N Hr ,oo)). 

T 

Although we will not employ them in this section, we also introduce the 
norms 

(3.24) 1^ = J> r *((0,iVtf r ]), \x\ 2 = J24((N Hr ,oo)) 

r r 

and 

(3.25) \x\s = Ml + max— z*((N Hr , oo)). 

r p r 

It obviously follows from (3.22) and (3.24) that |x| = + \x\2- The norm 
| • \s will be employed in Proposition 9.2 to derive the bound given in (3.11). 

Bounds on \-\l,\'\r an & I ' U- In order to derive (1.7), we need bounds 
on | • \l, I • \r and | • \a as the process X(t) evolves from t = to t = iV 3 . 
We first need to specify the term L appearing in (1.7). We choose l\ large 
enough so that 

(3.26) e'{h/2) > M\N 
and, for all r, 

(3.27) E[0(Zr);Zr > h/2] < (l/\H\)P^r > ^ 3 )- 
We set 

(3.28) L\ = ie(Zx) 
and 

(3.29) L = G(k 2 n N 17 VLi). 

For | • \l, we employ the bound from Proposition 10.2 that, for large 
enough N and b, small enough a, and appropriate C3 and £3 > 0, 

^[|X(iV 3 )| L ]-|x| L 

(3.30) 

< C 3 N 3 ■ l{\x\ < N 6 } + [C 3 (|x^/|x|)iV 3 - e 3 N 2 ] ■ l{\x\ > N 6 } 

for all x. The precise value of £3 is not important; in Proposition 10.2, it is 
given by \ min r w r . We assume wlog that £3 < C3 . 

For I • \r, we employ the bound from Proposition 5.1 that, for given £4 > 0, 
large enough N, and M2 = |(1 AmmiCi)(mm r r i(w r /w r i))Mi > C3, 

E x [\X(N 3 )\ R ]-\x\ R 
(3.31) < e^ 2 - M 2 (|rE|^/|x|)iV 3 • l{|x| > iV 6 } 

- k n N a ■ l{\x\ R > k 2 n N 17 , \x\ < N 6 } 
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for all x. We will later choose £4 small with respect to £3; the constant C3 
is chosen as in (3.15) and (3.30). 

For I • \a, we will show in Proposition 4.1 and Proposition 4.2 that, for 
this choice of £4 and large enough N, 

(3.32) E X [\X(N 3 )\ A ] - \x\ A < SiN 2 - M\N 3 ■ l{\x\ A > L/6} 
for all x. 



Derivation of (1.7) from (3.30), (3.31) and (3.32). We now derive (1.7) 
from these three bounds. Adding the RHS of (3.30), (3.31) and (3.32), one 
obtains, for large enough N and b, and small enough a, 



(3.33) E x [\\X(N 3 )\\]-\\x\\<2C 3 N 3 
for all x. We next consider the behavior of the LHS of (3.33) for > L/2, 



where L is given by (3.29). This condition implies that either \x\l > k 2 n N 17 



\x\ r > k 2 n N 17 or \x\a > L/6. 

Suppose first that \x\l > nf^N 17 . We note that if |x| < iV 6 , then 

Ml <C 4 iV 8 

for some constant C4. This bound follows from the definition of \x\l in (3.2), 
together with the bounds z*(s) < I2b 2 \x\ for all s, s N < N, and F(H*(sn)) > 
C5/N, for some C5 > [which follows from (3.10) and 7 < 1/4]. Therefore, if 
\x\l > nffN 17 and iV is large enough so that fsjv > 1, one must have \x\> N & . 
On the other hand, it follows from (3.31) that, on \x\ > iV 6 , 

(3.34) E X [\X(N 3 )\ R ] - \x\ R < £ A N 2 - M 2 (\x\ K /\x\)N 3 . 

Adding the terms corresponding to \x\ > N 6 in (3.30) and (3.32) to this 
implies that, for \x\ > TV 6 , and hence for \x\l > ^N 17 , 

E X [\\X(N 3 )\\] - \\x\\ < (2£4 - £ 3 )N 2 + (C 3 - M 2 )(|x|^/|x|)iV 3 

(3.35) 

< -eiN 2 , 

where the latter inequality follows for £4 < £3/3 and e\ = f £3/3, since M2 > 

c 3 . 

Suppose next that \x\r > k 2 n N 17 and |x| < iV 6 . Adding up the corre- 
sponding terms from (3.30), (3.31) and (3.32) implies that the LHS of (3.35) 
is at most 

(3.36) 2£4N 2 + C3N 3 - k n N a < -eiiV 3 



for large N, which is better than the bound in (3.35). 
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Suppose finally that \x\ A > L/6. We need to consider only the case \x\ < 
N®, since \x\ > iV 6 is covered by (3.35). In this case, it follows from (3.30), 
(3.31) and (3.32) that the LHS of (3.35) is at most 

(3.37) (3C 3 - Mi)iV 3 <-eiiV 3 , 
since Mi > 4C 3 . 

Together, (3.35), (3.36) and (3.37) imply that, for large enough N and b, 
and small enough a, 

(3.38) £U||X(iV 3 )||]-||x||<- e iiV 2 
for all ||x|| > L/2. Since for large N, 

L- L/2>2C 3 N 3 + e 1 N 2 , 
(1.7) follows easily form (3.33) and (3.38). 

4. Upper bounds on E x [\X(N s )\a\- In this section, we will demon- 
strate the inequality (3.32) for the upper bounds on ^[^(iV 3 )!^] — \x\ A . 
In Proposition 4.1, we obtain the first term on the RHS of (3.32); this holds 
for all x. We then obtain a better bound in Proposition 4.2, which is valid 
on \x\ > L/6. Both parts require just standard techniques. 

The first bound employs the following elementary inequality on the resid- 
ual interarrival times at time iV 3 : 

(4.1) \X(N 3 )\ A < \x\ A V i max{0(t r (k)) :reTZ,ke[2, A r (N 3 ) + 1]}. 

Here and in later sections, A r (t) denotes the cumulative number of arrivals 
at the route r by time t; A(t) will denote the corresponding vector. The 
inequality k < A r (t) + 1 implies that the interarrival epoch associated with 
£ r (fc) has already begun by time t. Recall that £ r (1) is the initial residual time 
at route r and £ r (2) , £ r (3) , . . . are i.i.d. random variables, and #(•) satisfies 
(3.16)-(3.19). 

Proposition 4.1. For any e > and large enough N, not depending 
on x, 

(4.2) E x [\X(N 3 )\ A ]-\x\ A <eN 2 . 

Proof. By (3.19), E[0(£ r )] < oo for all r. One can therefore show with 
some estimation that, for each r, 



(4.3) -E x \ max 0(£ r (k)) 



0, 



uniformly in x as t — > oo . For fixed x, (4.3) follows immediately from (4.83) 
of [1]; since A r (t) decreases when £ r (l) increases, this limit is uniform in x. 
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Inequality (4.2) follows immediately from (4.1) and (4.3), with t = N 3 . 

□ 

We proceed to Proposition 4.2. For the proposition, it will be useful to 
decompose |X(i)|yi — \x\ A as 

(4.4) \X(t)\ A -\x\ A = I A (t)-D A (t), 

where I A {t) and D A {t) are the nondecreasing functions corresponding to 
the cumulative increase and decrease of |X(-)| J 4 up to time t. That is, 
-Ta(O) = D A (0) = 0, with I A (t) being the jump process, with 

I A (t) - I A (t-) = \X(t)\ A - \X(t-)\ A 

and D' A (t) being the rate of decrease of |X(i)|^ at other times. We note 
that D A (t) is locally Lipschitz, with D' A {t) defined except at arrivals. In 
particular, since U' r {t) = — 1 except at arrivals, 

(4.5) D A (t) = — max. 0'(U r (t)) almost everywhere. 

N r 

We recall the definitions for li,L% and M\ in (3.26)~(3.28) and (3.15). 

Proposition 4.2. Suppose that \x\ A > L/Q. Then, for large enough N 
not depending on x, 

(4.6) E X [\X(N 3 )\ A ] - \x\ A < 1 - Mi iV 3 < -M X N* /2. 

Proof. We first show that 

(4.7) D A (N 3 ) > MiiV 3 . 

Since \x\ A > L/Q = k 2 n N 17 V L\ and 9(y) < y 2 for all y, one has, for N > 2, 
that max r u r > N 8 V h. So, for all t G [0, A^ 3 ], 

(4.8) max« r — maxU r (t) < N 3 < — maxu r . 

r r 2 T 

Consequently, for all t £ [0,N 3 ], 

1 o 1 

(4.9) m&xU r (t) > -maxu. r > JV V 

r 2 r 2 

Moreover, by (3.26) and (4.5), for max r U r (t) > \h, D' A (t) > Mi almost 
everywhere. Together with (4.9), this implies D' A (t) > Mi almost everywhere 
on [O,^ 3 ], and hence (4.7) holds. 

On account of (4.7), in order to show (4.6), it suffices to show 

(4.10) E X [I A (N 3 )}<1 
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for large N. To obtain (4.10), we first note that, for each r, there cannot 
be more than one interarrival time occurring over (0, iV 3 ] with value greater 
than TV 3 . Moreover, because of (4.9), only interarrival times with value at 
least iV 3 V (Zi/2) can contribute to I A (N 3 ). The expectation of 9(£ r ), for £ r 
conditioned on being greater than iV 3 and restricted to being greater than 
h/2, is 

(4.11) E[e^ r )^ r > h/2]/P(Cr > N 3 ). 

(If £ r is bounded above by A^ 3 , set the ratio equal to 0.) It follows that, for 
any x, 

(4.12) E X [I A (N 3 )} < ±J2 E i e (tr);tir > hm/P^r > N 3 ). 

r 

By (3.27), the RHS of (4.12) is at most 1/N, which implies (4.10). □ 

5. Upper bounds on E x [\X(N 3 )\r]. In this section, we will demon- 
strate the following proposition for the upper bounds on E^fl JT(iV 3 )|#] — 
\x\ R , where | • \ R is the norm introduced in (3.12). 

Proposition 5.1. For given e > 0, large enough N and all x, 
E X [\X(N 3 )\ R ] - \x\r < eN 2 - M 2 (|x|^/|x|)A^ 3 • l{|z| > N 6 } 

(5.1) 

- k n N a ■ l{\x\ R > n 2 N N l \ \x\ < iV 6 }, 
where M2 is specified before (3.31). 

The bound (5.1) implies (3.31), which was employed in Section 3, together 
with bounds on E X [\X(N 3 )\ L ] and E X [\X(N 3 )\ A ], to obtain (1.7) of Theorem 
1.1. The bound on E X [\X(N 3 )\ A ] was derived relatively quickly, whereas 
the bound on E X [\X(N 3 )\ L ] will require substantial estimation and will be 
derived in Sections 6-10. The bound on Kj;[|X(iV 3 )|fl] that is given here will 
require a moderate amount of work. 

In order to show Proposition 5.1, it will be useful to rewrite \X (t)\n — \x\r 

as 

(5.2) \X(t)\ R -\x\ R = I R (t)-D R (t), 

where I A (t) and D A (t) are the nondecreasing functions corresponding to 
the cumulative increase and decrease of up to time t. A similar 

decomposition was used in Section 4 for |X(t)|,4- Here, Ir(0) = D R (0) = 0, 
with I R (t) being the jump process with 

I R (t)-I R (t-) = \X(t)\ R -\X(t-)\ R . 
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One can check that Dr(-) is continuous except when a document departs 
from a route. Its derivative is defined almost everywhere, being defined ex- 
cept at the arrival or departure of a document. Since Dr(-) is nondecreasing, 

DR^-Dnih)^ I* D' R (t)dt for h<t 2 . 
Jti 

It is easy to obtain a suitable upper bound on E x [Ir(N 3 )] ; a suitable lower 
bound on E x [Dr(N 3 )] requires more effort. We therefore first demonstrate 
Proposition 5.2, which analyzes E x [Ir(N 3 )]. 

As in Section 4, A r (t) denotes the cumulative number of arrivals at route 
r by time t. It follows from elementary renewal theory that, for appropriate 
Cq and t > 1 , 

(5.3) E x [A r (t)] < C 6 t for each r 

(see, e.g., [3], page 136). Since large residual interarrival times can only delay 
arrivals, the bound is uniform in x. 

Proposition 5.2. For given e > and large enough N , 

(5.4) E x [I R (N 3 )]<eN 2 for all x. 

Proof. It follows from (3.12) that the expected increase in Ir(-), due 
to a document that arrives at route r, is 



M\ KN,r / N r (s)h*(s)ds. 
JN Hr 

Since the number of arriving documents by time iV 3 and their initial service 
times are independent, it follows that 



(5.5) E X [I R (N 3 )] = [M 1 K N , r [ 

V J is 



oo 

N r (s)h* r (s)ds)E x [A r (N 3 )}. 



In order to bound the first term on the RHS of (5.5), we decompose the 
integral there into f N + J™ . When N > Njj r , one has, by (3.9) and (3.13), 



N i r N 



K Ns r N r (s)h* r (s)ds= / sh*(s)ds 

J Nfj 1 IV-W I JN Hr 

(5-6) 

This is, for large enough N, at most 1/N 2 , because of the small power 7 in 
the definition of T(-). Also, 



00 /»oo 

r V 
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1 f 

(5-7) < — : jr — - — tt~ I S 2+Sl/2 h* r (s)ds 



A? 



ivi+5i/2r(i/iv 4 ) 

for appropriate CV, with the last inequality holding because of (3.7). Since 
7 < <5i/24, this is, for large A, at most l/A 1+5l//4 . Together, the bounds for 
the two integrals imply that, for large enough A, 



00 



(5.8) M lKN , r N r (s)h;{s)ds<2/N 1+Sl / 4 . 

Jn Ht 

Application of (5.8) and (5.3) to (5.5), with t = A 3 in (5.3), implies (5.4). 

□ 

We now derive a lower bound on E X [D R (N' 3 )]. As in (5.1), we need to 
consider two separate cases, depending on whether \x\ > A 6 or both \x\r > 
n 2 N N 17 and \x\ < A 6 hold. In both cases, we will employ the following lemma. 
Recall that M2 = ^CgMi, with Cg = (1 A mhn ci)(min r y (w r / w r >)) . 

Lemma 5.1. (a) For all t, 

(5.9) 0k(*)>Mi(|x|*:-|*(<)I*-)- 
(b) For almost all t, 

(5.10) 



"°° ' s 



>N Hr 

>8M 2 \X(t)\ K /\X(t)\. 



v vi)z* r (t, s )ds 



Proof. We first show (a). Recall that A(-) is the stochastic process 
constructed from X(-) in Section 1, where service of documents is pathwise 
identical to A(-), but where the arrival of documents is suppressed. One can 
check that, for all t and uj, 

(5.11) \X(t)\ K <\X{t)\ K 
and 

(5.12) D R (t)>\x\ R -\X(t)\ R . 

Inequality (5.11) follows immediately from Z*(t,B) < Z*(t,B) for all B C 
E + . For (5.12), note that the LHS gives the cumulative decrease of |A(-)|r 
over [0,t] due to the service of all documents, whereas the RHS gives the 
decrease due to service of only the original documents while ignoring the 
decrease due to service of new documents. 
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On account of (5.11) and (5.12), to show (5.9) it suffices to show 

(5.13) \x\ R - \X(t)\ R > M x {\x\ K - \X(t)\ K ). 

Substituting in the definition of | • |_r given by (3.12) and integrating by parts 
on the LHS of (5.13) gives 

M 1 Y,^N, r N r (N Hr )(z;((N Hr ,^)) - 2;(f, (N Hr ,oo))) 

(5.14) 

v ' poo 

+ MiVK7Vr / iV;(s)«((s,oo))-Z;(t,(s,oo)))ds. 
Jn Ht 

It follows from (3.14) and N Hr > 1 that N r (N Hr ) > 1 and that JV^(s) > 1 for 
all s. Consequently, (5.14) is at least 

M 1 J2^NA z *r((N Hr ,oo)) - Z* r (t,(N Hr ,oo))) 

r 

= M 1 (\x\ K -\X(t)\ K ), 

which implies (5.13). 

For (b), we first note that because of the weighted max-min fair protocol 
and (1.1), the rate at which each document is served is at least 

(5.15) ( m / nQ ) ( mi ?(™f/ ffl r)) 

Moreover, the rate of decrease of |X(t)|# per unit service of each document 
on route r is at least 

Mi_K Nr I N' r {s)Z;{t,s)ds>MiK Nr [ ( ^- V 1 )Z*(t,s)ds 

(5.16) 

>M lKN , r z;(jb,(N Hr ,oo)). 

Summing (5.16) over r and multiplying by (5.15) gives each of the bounds 
in (5.10). □ 

We first derive a lower bound on E x [Dr(N 5 )] in the case where |x| > A^ 6 . 

PROPOSITION 5.3. For large enough N and all \x\ > N 6 , 
(5.17) E X [D R (N 3 )} > M 2 {\x\ K /\x\)N\ 

Proof. We restrict our attention to the set 

B 1 ={to:\X(t)\ < \x\+N 6 for all t G [0,iV 3 ]}. 
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By applying Markov's inequality to inequality (5.3) with t = iV 3 , one has 
that, for large N, 

(5-18) P x (^2 A r (N 3 ) > < ^\K\ < \ 

for all x. Consequently, 

(5.19) P(J3i) > 1/2. 

This bound does not depend on \x\. 

We now consider two cases, depending on whether the set 

B 2 = {co:\X(t)\ K > \\x\ K for all t G [0,7V 3 ]} 

occurs. Since \x\ > iV 6 , it follows from the second half of (5.10) that, for all 

te[o,N% 

D' R {t) > 2M 2 \x\ K /\x\ 
on B\ H B 2 ■ Consequently, on B\ n B2 , 

(5.20) D R (N 3 ) > 2M 2 (|x|^/|x|)7V 3 . 
On the other hand, on B\ n -Bfi 

(5.21) \x\k-\X{t)\ k >\\x\ k 
for some (random) rG [0,iV 3 ]. By (5.9), 

D R {t)>M 1 {\x\ K -\X(t)\ K ) 
for all t. Together with (5.21), this implies that 

(5.22) D R (N 3 ) > D r (t) > \Mx\x\k > 2M 2 (\x\ K /\x\)N 6 , 

where \x\ > N e was used in the last inequality. 
Together, (5.20) and (5.22) imply that, on B 1 , 

D R (N 3 ) > 2M 2 (|x| A '/|x|)iV 3 . 

Inequality (5.17) follows from this and (5.19). □ 

We now derive a lower bound on E x [Dji(N 3 )] in the case where \x\r > 
k^N 17 and \x\ < N 6 both hold. We note that, starting from (5.26), the 
argument relies on the discreteness of documents. If one wishes to employ a 
fluid limit based argument rather than the discrete setting employed in this 
paper, different reasoning will be required at this point; it is not obvious 
how one would proceed. 
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Proposition 5.4. For large enough N , 

(5.23) E X [D R (N 3 )] > k n N 4 
for all \x\ R > k 2 n N 17 and \x\ < A 6 . 

Proof. As in the proof of Proposition 5.3, we restrict attention to the 
set B\ defined there. The bound P(Bi) > 1/2 in (5.19) continues to hold 
here. In our present setting, since |x| < iV 6 , wGBi implies that 

\X(t)\<2N 6 for all t G [0, A 3 ]. 

We also consider two cases, depending on whether 

B 3 = {u : \X(t)\ R > \n 2 N N 17 for all t G [0, A^ 3 ]} 

occurs. 

The case is almost immediate. It follows from (5.2) that, for large 
enough and for some r G (0, A 3 ] , 

(5.24) D R (N 3 ) > D r {t) > \x\ R - \X(t)\ r > \k 2 n N 17 > 2k n N 4 
for lo G -Bg. 

The case B% requires some work. We first note that, by the first part of 
(5.10), 



(5.25) 



^w£™,</;gh*;(m)4 



when to G B\ . 

We will truncate the second integral in (5.25) in order to be able to 
introduce an additional factor s into the integrand. We first note that, since 
$(0) = 0, if a document with residual service time at least s is present at 
time t on some route r, then, for large A, 

(5.26) \X(t)\ R > M 1 K N!r s 2 /N > M lS 2 /N. 
Hence, there are no documents with residual service time 

(5.27) sys^dN/M^X^R) 1 / 2 . 

It follows that, for appropriate Co, > 0, (5.25) is at least 
C 8 Mi ^ / f Sl+1 ( s 



2A 6 

r 



(^ vl ) z;(( ' s><is ) 
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(5.28) 



^ CSMI ' m Y, K ^r( N r (s)Z* r (t,s)ds 

2C9M/ y> K ( [°° N r (s)Z* r (t,s)ds] 
N^\X{t)\ 1 ' 2 ^ \JN Hr J 



= 2 -^^\m\T>c*M{>\ N N> 

for all t £ [0, iV 3 ]. The exponential tail of <&(•) is used in the last inequality; 
the equality relies on oj £ B3 . 

Employing the bound on D' R (t) obtained from (5.25) and (5.28), and 
integrating over t € [0, iV 3 ], it follows that, for large N, 

D R (N 3 ) > C 9 Ml /2 K N N 5 > 2k n N 4 

on Bx nB 3 . Together with (5.24), this implies that D R (N 3 ) > 2k n N 4 on 
B 1 . Inequality (5.23) follows from this and P(JBi) > 1/2. □ 



Proposition 5.1 follows immediately from (5.2) and Propositions 5.2, 5.3 
and 5.4. 



6. Upper bounds on E x [\X(N 3 )\l]: Basic layout and bounds on excep- 
tional sets. In this section, we begin our investigation of upper bounds on 
i? x [|X(A^ 3 )|i] — \x\l. Since these bounds will require us to examine a num- 
ber of subcases in Sections 6-9, we will only arrive at the desired bounds in 
Section 10. In the current section, we first state certain elementary inequal- 
ities, mostly involving | • | rjS , that will be useful later on. We then define the 
"good" sets A(-) of realizations of X(-) to which our bounds in Sections 7-9 
will apply. The remainder of the section is spent demonstrating Proposition 
6.1, which gives an upper bound on E x [\X(t)\L — \x\l; A(t) c ], where A(t) c 
is the small exceptional set. 



Elementary inequalities. Here we state a number of elementary inequali- 
ties that will be useful later on. Let £«(•), i = 1,2,3, denote configurations of 
particles on M + , with Zi(B) denoting the number of particles (or documents) 
in B C M + . If one assumes 

(6.1) z 3 (B) = Zl (B) + z 2 {B) forall£CK+, 

it follows that 



(6.2) zt(B) =z*(B) + z* 2 {B) for all B C ]R + , 
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where z*(B) is denned analogously to z*(B) below (3.5), with convolution 
being with respect to </>(•)■ Several elementary equalities follow from (6.2), 
including 

(6.3) |^3|r,s = |^i|r,s + |^2|r,s for all r £ 1Z and s > 0, 

where states in the metric space S introduced in Section 2 for which 

the analog of (6.1) is satisfied for each r and | • | riS is given by (3.2). 

Recall that X(-) and X A (-) are the processes constructed from X(-) that 
were introduced in Section 1, where service of each document is pathwise 
identical to X(-), but where, for X(-), the arrival of documents is suppressed 
and, for X A (-), only new documents are included. One has 

Z(t, B) = Z(t, B) + Z A (t, B) for t > and 

where the processes Z(-),Z(-), and Z A {-) correspond toX(-),X(-) and X A {-). 
From (6.2), 

(6.4) Z*(t,B) = Z*(t,B) + Z A '*(t,B) for t > and B C IR + , 
and from (6.3), 

(6.5) \X(t)\ r!S = \X(t)\ rtS + \X A (t)\ rtS for t >0,r £K,s>0. 
Another elementary equality involving X(-) is given by 

(6.6) Z r (t,B) = z r {B + A r (t)) for t>0,reK,BCR+, 

where, we recall, A r (t) is the translation that gives the amount of service an 
original document that has not yet completed service has received by time 
t. The equality relies on all documents on a given route r receiving equal 
service at each time. [If Z r (t,M + ) = 0, set A r (t) = oo and z r (M + + oo) = 0.] 
From (6.6), one obtains 

(6.7) Z*(t,B) <z* r {B + A r (i)) for t > 0, r G K, B C M+; 

the inequality arises from the possibility that original documents have com- 
pleted service by time t. 

A consequence of (3.2) and (6.7) is that 

(6.8) \X(t)\ r!S <\x\ rtS+Ar{t) for t>0,rG^,s>0. 
Combining (6.5) and (6.8) produces 

(6.9) \X(t)\ r , s <\x\ rtS+Ar{t) + \X A (t)\ r , s for t>0,r£K,s>0; 
taking the supremum over all r and s therefore gives 

(6.10) \X(t)\ L <\x\ L + \X A (t)\ L foralH>0. 
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Application of (6.7) also implies 

(6.11) Z*(t,s)<z*(s + A r (t)) for t>0,r£K,BCR+, 
and application of (6.7), together with (6.4), implies that 

(6.12) \X(t)\ 2 < \x\ 2 + \X A (t)\ 2 fort>0, 

where | ■ \ 2 is given in (3.24). The term on the LHS of (5.16) can also be 
derived using (6.11). 

The sets A(t). In this subsection, we define the random set A(t), which 
is a function of X(t'), for t' G [0,t]. In Sections 7-10, we will establish upper 
bounds on \X(N 3 )\ rtS for u G A(N 3 ); the exceptional small set A(N 3 ) C will 
be treated in the next subsection. The set A(t) will be a "good" set in 
the sense that the number of arrivals over [0,t], for given t, is restricted by 
upper bounds, which will enable us to show that |X(-)|x decreases in an 
appropriate manner. 

The set A{t) is given by A{t) = A\(t) D A 2 (t), with 

(6.13) Ai(t) = f\Ai, r ,j(t) fori = 1,2, 

where Ai t r,j(t) specify upper bounds on the numbers of weighted arrivals 
of documents with different service times. To define Ai >r ,j{t), we denote by 
v o , v\ , . . . , v j the increasing sequence with 

(6.14) v j+1 = v j + l/b 3 for j = 0, . . . , J — 1, 

with v o = and vj = N + 1, and where b is as in (3.3). Note that it follows 
from the second half of (3.4) that, for b > 2, 

(6.15) H;(v j+1 )/H;( Vj ) > 1/2 for all r and j. 

We also denote by S^(k), k = l,...,A r (t), the service time of the feth arrival 
at route r, where A r (t) is the cumulative number of arrivals at r by time t. 
We set, for r G 1Z and j = 0, . . . , J, 

(6.16) Ai >r ,j(t) = I u : Y ®( V J ~ S r( k )) < 2u r (Hr(vj)t Vt") >. 

Here, we assume r\ G (0, 1/12], and, as elsewhere, we set H r (-) = 1 — H r (-) 
and $(•) = 1 — $(•). One has, as a special case of (6.16), that 

(6.17) A r (t)<2v r t on Ai >r , (t). 
Since 

POO 

(6.18) E[$(vj- S r r(fc))]= / s)dH r (s)=H* r {v j ) 

Jo 
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and A r (t) ~ v r t for large t, the probability of the complement Ai t r,j(t)° 
can be bounded above by using standard large derivation estimates. The 
term t v is included on the RHS of (6.16) so that, when H*(vj) is small, the 
probability of the event remains small. 
We also set, for r S 1Z and j = 0, . . . , J, 

f A r (t) 

(6.19) A 2 , r ,j(t) = I u: <P(vj ~ Sr(k)) < (l+£ 5 )MK(vj)tVt V ) 

I k=l 

where £5 > 0. Analogous to (6.18), one has 

/•oo 

(6.20) E[4>(vj-S}{k))}= I <f>(v j -s)dH r (s) = h*(v j ). 

Jo 

The probabilities P x {A2,r,j{t) c ) will satisfy large deviation bounds as well. 
The constant £5 here will later be required to satisfy £5 < £7/4, where £7 is 
specified in (9.1) and measures how subcritical the network is. In (6.16), we 
only need to employ the constant 2, rather than 1 + £5 as in (6.19), because 
(6.16) will be applied to the right tail of H*{-), rather than the "main body" 
of H?(-), as will (6.19). 

Upper bounds on A{t) c . The main result in this last subsection is the 
following proposition. 

Proposition 6.1. For large enough t, 

(6.21) E x [\X(t)\ L - \x\ L ;A(t) c ] < N 3 e~ w 
for all N,x and appropriate Cio > 0. 

Proposition 6.1 gives strong bounds on the growth of |X(t)|^ on A(t) c . 
This behavior is primarily due to the small probability P x (A(t) c ), which is 
given in the next proposition. 

PROPOSITION 6.2. For large enough t, 

(6.22) P x (A(t) c ) <Ne~ Clltv 
for all N,x and appropriate C\\ > 0. 

The interarrival times are assumed to be independent, and large initial 
residual interarrival times only delay future arrivals. The initial state x will 
therefore not affect the bounds in (6.21) and (6.22). Note that only the 
arrival process ^4(-) is relevant for the bounds in (6.22). 

Proposition 6.2 will serve as the main step in demonstrating Proposition 
6.1; it will also be used along with Proposition 6.1 in Section 10. When we 
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apply (6.21) and (6.22) there, we will set t = N 3 and so the factors ./V 3 and 
./V can be absorbed into the corresponding exponentials. We note that Cio 
and C\\ in (6.21) and (6.22), and the bound on t depend on our choices of 
£5 and 6, and on v r and w r . 

In order to show Proposition 6.2, we will employ elementary large devia- 
tion estimates, which are given in the following two lemmas. 

Lemma 6.1. Let W(l), W{2), . . . denote nonnegative i.i.d. random vari- 
ables with mean (3 < 00. Then, for each e > 0, there exists C12 > 0, so that 

(6.23) P \J2 W ( k ) ^ i 1 ~ £ )P™J ^ e~ Cl2n . 

When the support of W(l) is contained in [0, 1] and e £ (0, 1], 



3.24) P I w ( k ) > (1 + £ )@ n ) < e " 



C 13 e 2 /3n 



where C13 > does not depend on the distribution of W(l) or on e. 

Proof. Both (6.23) and (6.24) are elementary large deviation bounds. 
We summarize the argument for (6.24); (6.23) can be shown directly or by 
applying (6.24) after truncating W(k). 

As usual, one employs the moment generating function 

(6.25) i) e {n) = £7[ e e £*=i(W(*)-0] for 6 > 0. 

By expanding the exponential for n = 1, it follows that for appropriate C14 > 
1 and for 6 G (0,1], 

(6.26) ^(1)<1 + CW30 2 , 
and hence 

(6.27) ij) e {n) < (1 + C u p6 2 ) n < e c ^ n . 

By applying Markov's inequality and setting 6 = e/2Ci4, it follows that the 
LHS of (6.24) is at most 

(6.28) e-^ dn ^e{n) < e -^M^ < e -C w e^n 
for C13 = l/4Ci4, as desired. □ 

Let W(l), W(2), . . . denote the successive interarrival times for a renewal 
process (with delay), with A(t) = max{n : Ylk=i W(k) < t} denoting the 
number of renewals by time t. Here, W(2), W(3), . . . are i.i.d., with W(l) 
being the residual interarrival time. We also introduce i.i.d. random vari- 
ables y(l),y(2), . . . , with Y(l) G [0, 1] that are defined on the same space 
as W(k). Set E[W(2)} = p > and E[Y(1)} = m. 
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Lemma 6.2. Let W{1),W(2), . . . and Y(l), Y{2), . . . be as above. Then, 
for given e £ (0, 1] and large t, 

/Mt) \ 
(6.29) P Y ( k ) > (! + e)r l mt < e ~ c ^ mt , 

where C15 > does not depend on the distribution ofY(l). 

Proof. {A(t) > n} is contained in the event {$2k=i W{k) < *}■ Conse- 
quently, by (6.23) of Lemma 6.1, substitution of e/3 for e there implies that, 
for n{t) = \{l-e/2>)~ l p- l t\, 



-Ci&t 



/n{t)+l \ 

(6.30) P(A(t)>n{t))<p[ J2W(k)<t\<e 

V k=2 J 

for appropriate Ciq > and large t (which may depend on e and the distri- 
bution of W). 

We next consider the set where A(t) < n(t). It follows from (6.24) of 
Lemma 6.1 that 

/Mt) \ 
P ^2 Y(k) > (1 + e)f3- 1 mt; A(t) < n(t) 
\k=i J 

(6.31) 

/n{t) \ 

Inequality (6.29) follows from (6.30) and (6.31). □ 

We now employ Lemma 6.2 to prove Proposition 6.2. 

Proof of Proposition 6.2. We first note that since A(t) = Ai(t) n 
A 2 (t), with Ai(t) = f] ren fl/=o Arj (*)> where J = b 3 {N + l) + l< 2b 3 N, it 
suffices to show that for each Ai, r j(t), 

(6.32) Px(Ai, r j(t) c ) < e~ c ^ 

for t > to, for some fixed to and appropriate Cyj > 0. 

We consider the case where i = 1. Denote by W(l), W(2), . . . the inter- 
arrival times of documents on route r and set Y(k) = $(vj — S^(k)). Then, 
Y(k) are i.i.d. random variables and, except for W(l), so are W(k). One has 

(6.33) P = E[W{2)} = v~ l and m = E[Y(1)} = H^Vj) 
with the last equality following from (6.18). 
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We break the problem into two cases, depending on whether or not 
fl?(ty)>t , '" 1 ,in each case applying Lemma 6.2, with e = 1. Under H*(vj) > 
f 1 , one has 



/A r (t) _ \ 

34) P x (Ai,rj(t) e ) = Px ( " 5 r (*0) > 2u r H^( Vj )tj < e" Cl5 



for large t and C15 > as in the lemma, where neither depends on the 
particular value of H*(vj). 

For H*(vj) < , we replace the random variables defined above (6.33) 
by i.i.d. random variables Y'(k) £ (0,1], with Y'(k) > Y(k) and E[Y'(k)] = 
tf 1 . Then, again applying Lemma 6.2, but this time to Y'(k), k = 1, 2, ... , 



(Ar(t) \ 

.35) Px(Al,rj(t) e ) <Px[J2 > 2V ^ ^ e " Cl5 



t'l 



k=l 



as before. Together with (6.34), this implies (6.32) for i ', = 1, with C17 = C15. 

The reasoning for (6.32) when i = 2 is the same, except that one now sets 
Y(k) = 4>{vj — S^(k)), from which one obtains 

(6.36) m = E[Y(l)] = h* r ( Vj ). 

Also, the coefficient 2 on the RHS of (6.16) is replaced by the coefficient 
1 + 65 in (6.19). Setting e = £5 in Lemma 6.2, one obtains 



/Ar(t) \ 

•37) P x f ~ S r(k)) > (1 + e 5 )u r (K(^) V t")J < e" 



Ci 5 t" 



for large £ and appropriate C15 > 0, chosen as in the lemma. Setting Cyj = 
C15, one obtains (6.32) for i = 2 as well. □ 

Setting |vl(i)| = ^ r A r (t), where A r (t) is the number of arrivals at each 
route by time t, it follows from elementary renewal theory that for appro- 
priate C\s and t > 1, 

(6.38) E[\A(t)\ 2 ]<C l8 t 2 

(see, e.g., [3], page 136). Inequality (6.38) is not difficult to show by applying 
a standard truncation argument. 

Here and later on, we will also use the two inequalities 

(6.39) z* r (s) < bz*((s, 00)) for all s > 0, 
and 

(6.40) r(H;.(N Hr + l))/T(H;.(N Hr )) > e~\ 
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which follow from the definition of </>(•) and the second inequality in (3.4). 
Employing Proposition 6.2 and these inequalities, we now demonstrate Propo- 
sition 6.1. 

Proof of Proposition 6.1. By Holder's inequality, 



(6.41) E x [\X{t)\ L - \x\ L ;A{t) c ] < y/P x (A(ty)y/E x [(\X(t)\ L - \x\ L ) 2 ]. 
Also, by Proposition 6.2, one has 

(6.42) V p x(-A(t) c ) < VNe~ Clltv/2 

for all N,x and appropriate Cxi > 0. So it remains to bound the expectation 
on the RHS of (6.41). 

It follows from the definitions of | • \l, <p(-) and T(-), and from (3.10), 
(6.39) and (6.40), that 

w r \ 2be b (l + aN)\x'\ ^ Ar2 , 



(6.43) \x'\ L < (*xp -L j < CWW 

for all x' 6 S and appropriate C±g. So application of (6.10), together with 

(6.43) for x' = X A (t), implies that 

E x [(\X(t)\ L - \x\ L f] < E x [\X A (t)\l) < C 2 19 N'E x [\X A (t)\ 2 ] 
<C 2 19 N 4 E x [\A(t)\ 2 }. 
Together with (6.38), this implies 

(6.44) ^E x [(\X(t)\ L -\x\ L f) < C 20 N 2 t 

for appropriate C20 and large t. 

Substitution of (6.42) and (6.44) into (6.41) implies that for large enough 
t, (6.21) holds, as desired. □ 

7. Upper bounds on |X(iV 3 )| r!S for s > NH r - In Section 6, we ob- 
tained upper bounds on E X [\X(N )\l — \x\l',A(N 3 ) c ]; we still need to ana- 
lyze the behavior of \X(N 3 )\l — \x\l on A(N 3 ). For this, we analyze |X(iV 3 )| rif 
for several cases that depend on whether or not \x\> N e and s > Nn r - 

In this section, we consider the case where \x\> iV 6 and s > Njj r , which is 
the simplest case. The main result here is the following proposition. Recall 
that I a; 1 2 is defined in (3.24). 

Proposition 7.1. For given 83 > 0, large enough N, and \x\ > N 6 and 
\x\ 2 /\x\ < 1/2, 



E x 



sup \X(N 3 )\ rs -\x\ L ;G 

L r,s>N Hr 

(7.1) 

<C 3 (\x\ K /\x\)N 3 + e 3 N 2 (l-P(G) 
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for all measurable sets G, with C3 > not depending on N , G or x. 

In the proof of Proposition 10.1, we will employ Proposition 7.1 by setting 

(7.2) G = A(N 3 )n{co:\X(N 3 )\ L = sup |X(7V 3 )| rjS ). 

r,s>N Hr J 

Much of the work needed to demonstrate Proposition 7.1 is done in the 
following proposition. We recall that i r (s) = s + A r , where A r = A r (iV 3 ). 

Proposition 7.2. For given e > 0, large enough N and all x, 

(7.3) E x \ sup {\X(N 3 )\ riS -\x\ rMs) };G\ <C 21 eN 2 

l r,s>N Hr J 

for all measurable sets G, with C21 not depending on e, N , G or x. 



Proof. We will instead show that 
(7.4) E x \ sup \X A (N 3 )\ r>s 



< C 2 ieN 2 . 



Inequality (7.3) follows immediately from this and inequality (6.9) since 

|^(JV 3 )|r |8 >0. 

To show (7.4), we first note that for all r and s, 

(7.5) \X A (N 3 )\ r>s < C 22 K N ,.N Hr Z^{N 3 ,s) 

for appropriate C 22 , where Z A '* (N 3 , s) == (Z A )*(N 3 ,s). The inequality uses 
(3.2) and (6.40). On s > N Hr , the RHS of (7.5) is at most 

roc 

(7.6) C 22 bK N , r N Hr Z A '*(N 3 ,(s,oc))<C 22 bK N , r / N r (s')Z A '*{N 3 ,s')ds' 

JN Hr 

on account of (6.39) and N r (s) > s. 

On the other hand, by (3.12), the RHS of (7.6) is at most 

(7.7) (Cnb/Mi)\X A (N 3 )\ R < C 21 I R (N 3 ), 

where C 2 \ = f C 22 bjM\ and Ir(-) is as in Section 5. Putting (7.5)-(7.7) to- 
gether, it follows that, for large N, 

(7.8) sup \X A (N 3 )\ r , s <C 21 I R (N 3 ). 

r,s>Nn r 

Also, by Proposition 5.2, for given e, one has that for large enough N, 

(7.9) E X [I R (N 3 )] < eN 2 for all x. 

Taking expectations in (7.8) and applying (7.9) implies (7.4). □ 



NETWORK STABILITY UNDER MAX-MIN FAIR BANDWIDTH SHARING 35 



In order to demonstrate Proposition 7.1, we need Lemma 7.1, which 
bounds \x\k from below in terms of \x\ when (sup rs>N[I \x\ r<s )/\x\L 1S not 
small. For the lemma, we require the inequality 

(7.10) z* r ((0,N Hr ]) < C 23 \x\ L for r e K, 

for appropriate C23. This is a weaker version of (9.8), which we prove in 
Lemma 9.2. [Equation (7.10) does not require any additional assumptions 
on a or b, unlike (9.8).] 

If one supposes that \x\2 < |#|/2, it then follows easily by summing (7.10) 
over r that 

(7.11) M L >c 24 M 

for C24 = l/2C 2 s\TZ\. This inequality will be used in Proposition 7.1 and will 
also be used in Sections 8 and 9. 

Lemma 7.1. Suppose that, for some tq and sq > Nn ro , 

(7.12) \x\ ro ,so > Ml/2. 
Then, for appropriate 8q > not depending on N , 

(7.13) \x\ K >e 6 \x\/N. 

Proof. Applying (7.10), and then substituting (7.12) into (3.2), one 
obtains for given r that 

z* r ((0,N Hr }) < C 25 Nz* ro (s )/nS* ro (N Hro + 1)) 

(7.14) 

<C 25 be b NK N , ro z* ro ((N Hro ,w)) 

for appropriate C25 > 0, where the second inequality employs the assumption 
so > N Hro , together with (6.39) and (6.40). Addition of z*((N Hr , 00)) to both 
sides of (7.14) gives 

<(M + ) < Z ;((N Hr ,oo)) + C 25 be b NK Niro z;. ((N Hro , 00)) 
< (1 + C 25 be b )N KN,r>4>((N Hr , , 00)). 

r' 

Summing over r then implies 

(7.15) \x\ ^e^Nlxlx 

with e 6 = [\TZ\(1 + C 25 be b )]- 1 . □ 

We now apply Proposition 7.2, together with Lemma 7.1 and (7.11), to 
demonstrate Proposition 7.1. 
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Proof of Proposition 7.1. Suppose first that |x| r0jS0 > \x\ L /2 for 
some ro and so > ^H rQ ■ Choosing e > and C21 as in Proposition 7.2, with 
e small enough so e < £3/021 for given £3 > 0, it follows from the proposition 
and Lemma 7.1 that for large N and any G, the LHS of (7.1) is at most 

C 2 ieN 2 < s 3 N 2 < 2e 3 e^(\x\ K /\x\)N 3 - e 3 N 2 

(7.16) 

<C 3 (\x\ K /\x\)N 3 -e 3 N 2 , 

if C 3 is chosen to be at least 2e 3 Eq 1 , where e$ is as in the lemma. This is at 
most the RHS of (7.1). 

Suppose, on the other hand, that \x\ rjS < \x\l/2 for all s > Nn r and r. 
Under \x\ > N 6 and \x\ 2 < \x\/2, it follows from (7.11) that \x\ L > C 2 aN 6 . 
Hence, 

(7.17) sup \x\ r , s -\x\ L <--C 2i N & . 

r,s>N Hr 2 

Since i r (s) > s > Nh t , it follows from Proposition 7.2 and (7.17) that the 
LHS of (7.1) is at most 

C 2 ieN 2 + sup \x\ r , s -\x\ L <C 2 ieN 2 -\c 2 iN 6 P(G) 

r,s>N Hr 2 

(7-18) 

<e 3 N 2 (--P(G)j 

for large N, if we choose e < e 3 /2C 2 i- This is at most the RHS of (7.1), 
which completes the proof. □ 

8. Pathwise upper bounds on |X(AT 3 )| r s for s < Nn r and A r > 1/fo 3 . 

In the previous section, we analyzed the behavior of |X(A r3 )| r . iS — \x\l for s > 
NH r - When s < Nh t , we analyze the cases where A r < 1/6 3 and A r > 1/6 3 
separately. The latter case is quicker and we do it in this section, postponing 
the case A r < 1/6 3 until Section 9. For both cases, we will require certain 
pathwise upper bounds on \X A (N 3 )\ r ^ s that hold on „4i(iV 3 ), which are given 
in Proposition 8.1. We begin the section with these bounds. 

Upper bounds on \X (N^)\ r 8 on „4,i(iV 3 ). In order to derive bounds on 
\X A (N 3 )\ r ^ s , we first require bounds on Z r '*(•, •) that measure how quickly 
documents with the corresponding service times enter a route r up to a given 
time. In Lemma 8.1, we provide uniform bounds on Z^'*(t,s) for t G [0, iV 3 ] 
and oj £Ai(N 3 ). As in previous sections, Sj(k), k = 1, . . . , A r (t), denotes 
the positions of the arrivals of documents up to time t. We also denote here 
by S 2 (t,k) the amount of service such a document has received by time t; 
S}(k) — S 2 (t,k) is therefore the residual service time of the kth document 
at time t. 
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Lemma 8.1. Suppose uj G A\(N 3 ) for some N . Then, for all r, s G 
[0,-ZV + l] andt£ [0,N 3 ], 

(8.1) Z^'*{t,s) <4bv r {H*{s)N 3 ViV 3 "). 
If instead s > N + 1, then 

(8.2) Zf'*(t, s) < 2bu r (H*(N + 1)N 3 V N 3r >). 
Proof. For all r, s G [0, N + 1] and t G [0, N 3 }, 

Ar(t) 

zf'*(t,s)=J2 <K8-Si(k) + $(t,k)) 

, ^ fe = 1 

(8.3) 

A r (N ) A r (7V 3 ) 

< SU P &(s-SXk) + s')<b ${s-S}.(k)) 

k=i »'e[o,oo) k=1 

with the latter inequality employing <j)(s) < b$(s) and the monotonicity of 
<£(•). Letting jo denote the largest j with Vj < s, the last term in (8.3) is at 
most 

(8.4) b Hv j0 -S}(k))<2bi> r (H* r (v j0 )N 3 VN s *) 

k=l 

on .4i(iV 3 ). The inequality in (8.2) follows from this, with jo = J. The in- 
equality in (8.1) follows by applying (6.15) to the RHS of (8.4). □ 

We now derive uniform upper bounds on \X A (t)\ r>s for t G [0,iV 3 ] and 
ijj G Ai(N 3 ). In applications, we will be primarily interested in the behavior 
at t = N 3 . 

Proposition 8.1. Suppose to G „4i(iV 3 ) for some N. Then, for all r 
and s, 

(8.5) \X A {t)\ r:S < C 26 N 3 for t G [0,iV 3 ] and all x, 

for appropriate C 2 q not depending on x,N,ui,r or s. In particular, 

(8.6) \X(t)\ L -\x\ L <C 26 N 3 fort£[0,N 3 ]andallx. 

Proof. By (6.10), 

(8.7) \X(t)\ L -\x\ L <\X A (t)\ L foralH, 
and so (8.6) follows immediately from (8.5). 
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We now investigate |X^(t)|j. )S . From (3.2) and Lemma 8.1, it follows that, 
for tG [0,iV 3 ], 

, _ w r (l + as N )Z A '*(t,s) 
|A [t)ks - v r T(H?(s N )) 

(8.8) < Abw r N 3 (l + as N )H;(s N )/r(H;(s N )) 

+ 4bw r N^(l + as N )/T(H;(s N )). 

We proceed to analyze the two terms on the RHS of (8.8). 
It follows from the definition of T(-) in (3.8) that, for all s, 

(8.9) (1 + as N )B:(s N )/T(B:(s N )) < (1 + as^H^SN)) 1 " 1 /aC 2 . 

Since by assumption, H*(-) has more than two moments and 7 < 1/2, the 
RHS of (8.9) goes to as sn — > 00. Hence, it is bounded for all sn, which 
implies that the first term on the RHS of (8.8) is bounded above by C 2 jN 3 , 
for some C27 not depending on t, r or s. 
On the other hand, for all s, 

(1 + as N )/F(H* r (s N )) < (1 + a(N + l))H* r (N Hr + l)^/aC 2 

(8.10) 

<(l + a(iV + l))(e 6 iVV/C 2 a. 

Since 7 < 1/4, 77 < 1/3 and aN > 1, the latter term on the RHS of (8.8) is 
bounded above by C28-/V 2 , for some C28 not depending on t, r or s. 

The above bounds for the two terms on the RHS of (8.8) sum to (C27 + 
C 2 s)N 3 . Setting C 26 = C 27 + C 28 , this implies (8.5). □ 

Upper bounds on |X(iV 3 )| r)S for s < Nh t and A r > 1/6 3 . Proposition 8.2 
gives an upper bound on \X(N 3 )\ r ^ s — \x\l when s < Nh t and A r > 1/b 3 . 
The proof, which employs Proposition 8.1, is quick. 

Proposition 8.2. Suppose that \x\ > N 6 , with |x| 2 /|x| < 1/2. Then, for 
large enough N , 

(8.11) sup sup |X(iV 3 )| riS - \x\ L < -N 4 for all u eA^N 3 ), 

A r >l/6 3 s<JV Hr 
where N does not depend on x or u. 

Proof. For each r and s, 

\X(N 3 )\ T:S - \x\ L = \X A (N 3 )\ r , s - (\x\ L - |X(iV 3 )| r , s ) 

(8.12) 

<C 26 N 3 -(\x\ L -\X(N 3 )\ r , s ) 

with the last line following from Proposition 8.1. We consider two cases, 
depending on whether |x| r) j r ( s ) > \x\l/2 for given r and s. 
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Suppose first that |x| rj j r ( s ) > |x|l/2, with s < Nh t and \x\2 < \x\/2. One 
has 

(8-13) \Ar M s) ~ \X(N")\ r ,s > ^ • £ ■ ^ff:^ rr - 

u r b 6 T(H*(i r (s) N )) 

To see this, one applies (6.7) to the definition of |x| r . jS in (3.2), noting that 
since s < Nn r , 

(8.14) i r (s) N -s N = i r (s) A (N Hr + 1) - s > A r A 1 > 1/fe 3 , 

and that T(H*(i r (s)N)) < r(.ff*(s)). On account of (3.2) and |x| ri i r ( s ) > 
\x\l/2, one obtains, from the RHS of (8.13), 

a_ ( w r z*(i r (s)) j, \ \ArMfi) | | 

b^{u r r(H*(i r (s) N ))/ lxlrMs) )' \x\ L 

> p • (1 + ai r (s) N y 1 ■ - ■ \x\ L . 

Because of \x\2 < |cc|/2, (7.11), i r {s)N < N, \x\ > N e and aN > 1, this is at 
most C29N 5 , where C29 > does not depend on N, x or to. It follows from 
(8.13) and the succeeding inequalities that 

(8.15) \x\ L - \X(N 3 )\ r , s > \x\ rMs) - \X(N 3 )\ riS > C 29 N 5 . 

Together with (8.12), this gives the RHS of (8.11). 

Suppose, on the other hand, that |x| r j r ( s ) < \x\l/2, with \x\2 < \x\/2. 
Then, by (7.11) and (6.8), the RHS of (8.12) is at most 

C 26 iV 3 - ||x| L - (\x\ rMs) - \X(N 3 )\ r , s ) 

(8.16) 

< C 2(i N 3 - iC 24 A f6 < -N 5 

for large N. This implies (8.11) for |x| ri j r ( s ) < \x\l/2, and hence completes 
the proof. □ 

9. Pathwise upper bounds on |X(AT 3 )| rjS for s < N Hr and A r < 1/b 3 . 

In Sections 7 and 8, we analyzed the behavior of \X(N 3 )\ rjS — \x\l for s > 
N Hr , and for s < N Hr with A r > 1/b 3 . There remains the case s < N}j r with 
A r < 1/b 3 , which is the subject of this section. This is, in essence, the "main 
case" one needs to show in order to establish the stability of the network 
since the other cases dealt with less sensitive behavior and did not employ 
the subcriticality of the system that was given in (1.2). The same was also 
true for the computations of the \ -\a and \ ■ \r norms in Sections 4 and 5. 

Section 9 consists of three subsections. First, in Proposition 9.2, we give 
lower bounds on the minimal service rates X w (-) of documents in terms of the 
norm In the next subsection, we begin our analysis of |A"(A^ 3 )| r . jS for s < 
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N Hr and A r < 1/6 3 . We decompose \X(N 3 )\ r , s — \x\ r ,i r (s) into several parts 
that are easier to analyze. In Proposition 9.3, we then obtain upper bounds 
on the factor Z*(N 3 , s) — z*(i r (s)) of one of the parts. In the third subsection, 
we do a detailed analysis of the decomposition from the previous subsection, 
which also employs the bounds on X w (-) from the first subsection. From 
this, we obtain in Proposition 9.5 the desired bound on |X(A r3 )| T . jS — \x\l. 
We note that, whereas in Section 8, our results pertained to u G „4i(iV 3 ), 
starting from the second subsection here, we require oj € A.2(t)- Our final 
results on \X(N 3 )\ r ^ s — \x\l, for s < Nn r , will therefore be valid on A(N 3 ) = 
A 1 (N 3 )nA 2 (N 3 ). ' 

Lower bounds on X w (-). In order to demonstrate the stability of the 
network, its subcriticality needs to be employed at some point. With this in 
mind, we choose £7 G (0, 1] small enough so that 

(9.1) (1 + e 7 f ArPr < c l for all I, 

which is possible because of (1.2). We henceforth assume £5 < £7/4, where 
£5 was employed in (6.19) in the definition of «4-20- 

The main results in this subsection are Propositions 9.1 and 9.2. Propo- 
sition 9.1 gives a lower bound on X w (t) in terms of |X(t)|,s; Proposition 9.2, 
under additional assumptions, gives the bound in terms of \x\l- 

Proposition 9.1. Assume (9.1) holds for some £7 > 0. Then, for large 
enough b and small enough a, 

(9.2) X w (t)>(l+e 7 )/\X(t)\ s 
for almost all t. 

In this and the previous subsection, we need to employ certain properties 
of r(ff *(•)), which appears in the denominator in (3.2). In Lemma 9.1, we 
state two such properties; the first is employed for Lemma 9.3 and the second 
is employed for Lemma 9.2. Recall that m r is the mean of H r (-). 

Lemma 9.1. For r(-) as defined in (3.8), 

(9.3) T'(H*{s)) > 1 + as for all r and s. 
Moreover, for large enough b and small enough a, 

(9.4) f°° T ( H r(s)) ds < (1 + £7 ) mr 
J 1 + as 

for £7 > satisfying (9.1). 
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Proof. By (3.8) and then (3.7), one has, for all r and s, 
T'(H* r {s)) = 1 + C 2 ia{H* T {s)f- 1 

(9.5) 

> 1 + C 2 C 1 7 ~ 1 7 a(l + s) {l -^ 2+Sl) > 1 + as, 

where the last inequality uses 7 < 1/2 and C 2 > C^ 1 "^^ . This implies (9.3). 
For (9.4), we note from (3.8) and (3.7) that 

(9.6) n T { H *( S » ds< f°° S:(s)ds + Cfa /' 00 (l + s)- 27 (l + as)- 1 ds. 
Jo 1 + as J Jo 

The constant b can be chosen large enough so the first term on the RHS 
of (9.6) is at most (1 + ej/2)m r . Also, by choosing a > small enough, 
since the second term can be chosen as close to as desired, by monotone 
convergence, 

r 00 r 00 1 -i- s 

(9.7) a {l + S y 2 "'{l + asy l ds= (1 + s y^+ 2 ^ ——- ds -> 

Jo Jo V a + s 

as a \ 0. So, for large enough b and small enough a, (9.4) holds. □ 



By employing (9.4), we obtain upper bounds for z*((0, Nn r ]) and z* 
in terms of \x\l and |x|,g. Inequality (9.9) will be crucial for Proposition 9.1. 



Lemma 9.2. For large enough b and small enough a, 

(9.8) z* r ((0,N Hr ]) < (1 + £ 7 K~VMl 
and 

(9.9) z;(R + ) < {I + e 7 )w~ l p T \x\s 
for all N and r, where £7 > is as in (9.1). 



Proof. We note that by (3.2), 

-N Hr 



(9.10) z;((0,N Hr ]) 



z*(s) ds < w r 1 ^ r I x I x, 



N Hr 



1 + as 



ds. 



By (9.4), for large enough b and small enough o, the last term in (9.10) is 
at most 

(9.11) (1 + E'i)w~ 1 i> r m r \x\i J = (1 + s>j)w~ Pt\x\l 

for all N and r, which implies (9.8). It follows from (9.8) and the definition 
of I • |s in (3.25) that 



< (1 + e 7 )w r V 

< (1 + e 7 )w~ 1 p r \x\ s , 



w. 



x\l + —z*((N Hr ,oo)) 

Pr 
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which implies (9.9). □ 



A weaker version of the bound (9.8) was used in (7.10), where the RHS 
of (9.8) was replaced by C23|x|l> and no additional assumptions on b and a 
were required. This follows by noting that the second term on the RHS of 
(9.6) does not depend on a (since a < 1). 

We now demonstrate Proposition 9.1. 



Proof of Proposition 9.1. On account of (9.1), a feasible protocol 

is given by assigning service to each nonempty route r at rate A rj ^ = f (1 + 
si) 2 p r . By (9.9), the rate at which each document is served is 

(9.12) KF= i^e 7 )y (l + e,? P il + e 7 )^ 

Z r (t,R+) Z*{t,R+) ~ \X(t)\ s 

at almost all times t. It follows from this and the definition of the weighted 
max-min fair protocol that 

X w (t) = min ^ r > min > 7 — ~\ f, 7 ^ for almost all t, 
raw w r r&V w r \X(t)\s 

which implies (9.2). □ 

We apply Proposition 9.1 to derive the following lower bound of X r (t) on 
[0, N 3 ]. We note that, by (8.6) of Proposition 8.1 and (7.11), for to G Ai(N 3 ), 
\x\ > N 6 and \x\ 2 < \x\/2, 

(9.13) \X(t)\ L < \x\ L + C 26 N 3 < (1 + e)\x\ L 



holds for given e > and large enough N. In the proposition, we will use 

Yr 



def 
£8 = 



C 2 4 / W r X 



max — e-j 



1 

A 2- 



Proposition 9.2. Suppose that (9.1) holds for some e 7 G (0,1], and 
that \x\ > , with \x\ 2 < esl^l- Then, for large enough N and b, and small 
enough a, 

(9.14) X w (t)>(l + e 7 /2)/\x\ L 
for almost all t £ [0,N 3 ] on to £ ^4i(iV 3 ). 

Proof. It follows from Proposition 9.1 that 

(9.15) X w (t) > (l+e 7 )/\X(t)\ s almost everywhere, 
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for large enough b and small enough a. On the other hand, it follows from 
(3.25), (9.13), (6.12) and (6.17) that, since \x\ > N 6 and \x\ 2 <e 8 \x\, 

\X(t)\s<\X(t)\ L +(max^)z;(t,{N Hr ,oo)) 

V r Pr J 

(9-16) 

< (1 + s)\x\l + (max— J |x| 2 + 2^maxf r ^iV' : 

holds for given e > and large enough N, for all uj £ A±(N 3 ) and t S [0, N 3 ]. 
Applying \x\ > iV 6 , \x\? < es\x\ and (7.11) to the RHS of (9.16) implies that 
it is at most 

^1 + e + \x\l + 2^max^^ (max.v r ^J \x\l/C 2 aN 2 . 
Consequently, for small enough e > 0, 

(9.17) \X{t)\ s < (l + e 7 /4)\x\ L for all t G [0, A^ 3 ]. 

Together with (9.15), this implies (9.14). □ 

Decomposition of \X(N^)\ TiS — \x\ r ^i s \. In this short subsection, we de- 
compose |X(7V 3 )| r)S — |x| rj j r ( s ) into several parts, one of which contains the 
factor Z*(N 3 ,s) — z*(i r (s)). In Proposition 9.3, we then obtain upper bounds 
on this factor. In this and the remaining subsection, the estimates need to 
be more precise than in previous sections in order to make use of the sub- 
criticality of -X"(-)- 

The decomposition that was referred to above is given by 

\X(N 3 )\ r , s -\x\ rMs) 

w r (l + as)(Z*(N :i ,s) - z*(i r (s))) 



(9.18) 



v r T(a r ) 

1 + as T{o~ r ) — T(o~' r ) aw r A r z*(i r (s)) 



1 lrMsy l + ai r (s) T(a r ) v r T{a' r ) 

and holds for s < Njj r and A r < 1/6 3 . It will be employed in Corollary 9.1. 
Here and later on, we abbreviate, setting a r = H*(s) and a' r = H*(i r (s)). 
[One can check that (9.18) holds as given, without employing either sn or 
v(s)iv, as in (3.2), since i r (s) = s + A r < Njj r + 1, and hence sn = s and 
i r (s) N = i r (s).] 

To apply the bound (6.19) onw£ A2{N 3 ) and derive an upper bound on 
Z*(N 3 ,s) — z*(i r (s)), we need to select a Vj from among vq,...,vj, as given 
by (6.14). For this, we denote by v(s) the value Vj with 

(9.19) v j e[i r (s),i r (s) + l/b 3 ). 

Under s < Nn r and A r < 1/6 3 , such a v(s) exists. 
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Proposition 9.3. Suppose uj e A2{N 3 ), for some N and b, with b as 
in (3.3). Then, 

(9.20) Z;{N 3 , s) - z;(i r {s)) < (1 + e 5 )(l + A/b 2 )v r [h* r (v(s))N 3 V iV 3,? ] 

for all r and s with A r < 1/b 3 and s < Nff r! where £5 > is as in (6.19) 
and v(s) is given by (9.19). 

Proof. By (6.7), the LHS of (9.20) is at most Z^'*(N 3 , s). For s < N Hr , 
this equals 

A r (N 3 ) A r (N 3 ) 

^{s-Sl{k) + Sl{N 3 ,k))<e 2 ' b2 Hv(s) ~ S}.(k)) 

k=l k=l 

(9.21) 

A r {N A ) 

<(l + 4/6 2 ) £ 4>{v{s)-Sl{k)). 

k=l 

To see (9.21), we note that since S 2 (N 3 , k) < A r < 1/b 3 , 

(9.22) Vj -Sl(k) G [s - Si(k) + S 2 (N 3 ,k),s - Sl(k) + S 2 (N 3 ,k) + 2/b 3 ]. 

Together with the second half of (3.4), this implies the first inequality. The 
second inequality follows by expanding e 2 l b . Since uj £ A2(N 3 ), the RHS of 
(9.20) then follows by applying (6.19). □ 

In the next subsection, we will also employ the following bound on h*{s2) — 
h*(si) for s\ < S2- 

Proposition 9.4. For any r, si < S2 and b, 

(9.23) h* r (s 2 ) - K(si) < eb 2 (s 2 - si)H r *(si). 

PROOF. Since h* r (s) = <f>(s - s') dH r (s') for each s, the LHS of (9.23) 
equals 

POD 

(9.24) / (^a-aO-^ai-sOWOO- 
J 

By the first part of (3.4) and the definition of </>(•), (p'(s) < b 2 for all s and 
(f>{-) is decreasing on [1/6, 00). So, (9.24) is at most 

/>oo 

/ b 2 (s 2 - ai)l{s' >s 1 - 1/6} dH(s') < b 2 (s 2 - fli)flr(*i - V&) 

JO 

(9.25) <6 2 (s 2 -si)^;(si-l/6) 

<e6 2 (s 2 -si)i?;(si). □ 
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Upper bounds on \X(N s )\ rjS . In this subsection, we employ the previous 
two subsections to obtain upper bounds on |X(iV 3 )| riS — \x\l for u £ A(N 3 ), 
when s < Nn r and A r < 1/6 3 . Our main result is the following proposition. 
As elsewhere in this paper, we are assuming that aN > 1. 

Proposition 9.5. Suppose that (9.1) holds for some e-j £ [0, 1] and that 
\x\> N e , with \x\2 < £s \x\ , where eg is specified below (9.13). Then, for large 
enough N and b, and small enough a, 

(9.26) \X(N 3 )\ rjS -\x\ L <-lw r N 2 

for u € A(N 3 ), and all r and s with A r < 1/b 3 and s < Nh t - 

Our main step in demonstrating Proposition 9.5 will be to demonstrate 
the following proposition. 

Proposition 9.6. Under the same assumptions as in Proposition 9.5, 

w T {l + as)(Z*(N 3 ,s)- z* T {i r {s))) ^ . l + as T(a r )-T(a' r ) 

<C x II ' * ^ 

u r T(a r ) ~ l + ai r (s) r(ov) 

(9.27) 

C30W r N 3 3/2 

ab{l + as) 

for appropriate C30 and C31 not depending on w,N,a,b,r or s. 

In order to demonstrate Proposition 9.6, we note that, on account of 
Proposition 9.3, the LHS of (9.27) is, under the assumptions for the latter 
proposition, at most 

(i r (s)(/i r Xs))iV 3 VN 3ri ) 

(9.28) <d r (s)( mf X(s'))N 3 

+ d r (s)(h* r (v(s))- inf h*Js'))N 3 + d r (s)N 3v , 



where 



dr(s) = (1 +e 5 )(l + 4/6 2 K(l + as)/T(a r 



We will show in Lemmas 9.3, 9.4 and 9.5 that each of the three terms on 
the RHS of (9.28) is bounded above by the corresponding term on the RHS 
of (9.27). Proposition 9.6 then follows. 

We first show Lemma 9.3, which applies to the first term on the RHS of 
(9.28), and should be thought of as the "main term" there. 
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Lemma 9.3. Under the same assumptions as in Proposition 9.5, 
(9.29) d r (s)( inf K(s'))N 3 <\x\ L 1 + aS r ^)" r ^) 



s'e[s,i r (s)] r J l + ai r (s) r(cr r ) 

Proof. It follows from Proposition 9.2 that 

(9.30) X w (t) > (1 +e 7 /2)/\x\ L for almost all t G [0, N 3 ] 
for large enough ./V and b, and small enough a, and therefore 

(9.31) A r > (1 + e 7 /2)w r N 3 /\x\ L for all r. 
Consequently, the LHS of (9.29) is at most 



d r {s)\x\ L { inf h;(s'))A r /w r {l + e 7 /2) 

\s'£[s,i r (s)] / 

(9 32) 

< d r (s)\x\ L (S* r (s) - H* r (i r (s)))/ Wr (l + E 7 /2). 

This last quantity can be rewritten as 

(l + e 5 )(l + 4/b 2 ) 1 + as T(a r )-T(a' r ) 

l + e 7 /2 ' ^ L ' 1 + aiJs) ' T(a r ) 

(9.33) 

1 + ai r (s) 
X (T(a r )-T(a' r ))/(a r -a' r Y 

We proceed to bound the components of (9.33). Since £5 < £7/4, one has 
for large enough b, depending on £7, that 

(9 34) a±^ffl s(1+1/ ^-.. 

1 + £7/2 

Since T(-) is concave and oy > cr^., 

coq^ r(cr r ) - r«) . ^^-.i 

(9.35) : > 1 {a r ) > 1 + as, 

a r - a' r 

with the second inequality holding on account of (9.3). So the last term in 
(9.33) is at most 

(9.36) i±^l = l + -^< 1 + 1/63, 

1 + as 1 + as 

where the inequality uses A r < 1/6 3 . Consequently, (9.33) is, for large b, at 
most 

(i + i/tf )-'(i + i/ 6 = )Wl -L±i^ . r ' g -»-^» , 

1 + ai r {s) 1 (oyj 
which is at most as large as the RHS of (9.29). This implies the lemma. □ 

We next demonstrate Lemma 9.4, which applies to the second term on 
the RHS of (9.28). 
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Lemma 9.4. For all r and s with A r < l/b 3 and s < Nu r , 

(9.37) d r {s)(h*Ms)) - inf K(s'))n 3 < 

V 8'e[«,ir(s)] / ao(l + as) 

for appropriate C30 not depending on w,N,a,b,r or s. 

Proof. Since v(s) — s < 2/b 3 , it follows from Proposition 9.4 that the 
LHS of (9.37) is at most 

(9.38) (l + e 5 )(l + A/b 2 ) 2 ^w r N'H;.( S ) { -^^. 

On account of (3.8), since 7 < #i/4, b > 2 and £5 < 1, this is at most 

^iV3(iJ r *( s ))l-7 (1 + as) < ^l^lAr3 (1 + as) l-(l-7)(2+ 5l ) 

(9.39) 

24CiuviV 3 



C 2 a6(l + as) 



Recall that Ci and C2 do not depend on w, N, a, 6, r or s. The RHS of (9.37) 
follows from this last term by setting C30 = 24Ci /C2 . □ 

We now demonstrate Lemma 9.5, which applies to the third term on the 
RHS of (9.28). 

Lemma 9.5. For all s < N Hr , 
(9.40) d r (s)N' 3v < C-iiWrN^ 2 

for appropriate C31 not depending on w, N,a,b,r or s. 

Proof. Since s < N Hr <N,j< 1/24, rj < 1/12, b > 2 and e 5 < 1, it 
follows from (3.8) and (3.10) that the LHS of (9.40) is at most 

9-41 r -\ J < —- Wr NV\l + aN). 

Since aN > 1, this is at most 8w r N 3 / 2 /C 2 , which gives the RHS of (9.40) 
forC 3 i = 8/C 2 . □ 



Proposition 9.6 follows by applying Lemmas 9.3, 9.4 and 9.5 to (9.28). 
We will apply the following corollary of the proposition to Proposition 
9.5. The corollary combines the inequality (9.27) with (9.18). 
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COROLLARY 9.1. Under the same assumptions as in Propositions 9.5 
and 9.6, 

(9.42) \X(N*)\ r , s -\x\ L < + C 3lWr N^ - <(* (.)) 

ab(l + as) v r Y \a r ) 

for appropriate C30 and C31 not depending on w, N , a, b, r or s. 

Proof. The first term on the RHS of (9.27) of Proposition 9.6 is at 
most 



, > 1 + as r(<r r ) - T(a' r ) 

|X W)l + a i r ( s ) ' T(a r ) +WL WrMs) 



since the coefficients of |x| rj j r ( s ) in the first term in (9.43) are at most 1. 
Substituting (9.43) into (9.27) and then applying the resulting inequality to 
the RHS of (9.18), we note that the term on the LHS of (9.27) is the first 
term on the RHS of (9.18) and the first term in (9.43) is the negative of the 
second term on the RHS of (9.18). After the resulting cancellation, the last 
two terms on the RHS of (9.27), together with the last term on the RHS of 
(9.18), give the RHS of (9.42). □ 

In order to show Proposition 9.5, we will need a lower bound on the last 
term on the RHS of (9.42) and an upper bound on each of the first two terms. 
In the following lemma, we obtain the former. Note that the assumptions 
in the lemma are those of Proposition 9.2, with the additional assumption 
that 

(9.44) \x\ rMs ) > \x\ L /(l + e 7 /2) for some s<N Hr . 



Lemma 9.6. Suppose that (9.1) holds for some e 7 G (0, 1], that \x\ > N 6 
with \x\2 < ffsMi an d that (9.44) ^ s satisfied for a given s. Then, for large 
enough N and b, and small enough a, 

„„ eA( iv=). 

v ; u r F{a' r ) ~ 1 + ai r (s) v ' 

PROOF. By Proposition 9.2, 

(9.46) \ w (t) > (l + e 7 /2)/\x\ L for almost all t G [0,iV 3 ]. 
Consequently, 

(9.47) A r > (1 + e 7 /2)w r N 3 /\x\ L for all r. 
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It follows from (9.47), (3.2) and (9.44) that the LHS of (9.45) is at least 
(g 4g) (l+s 7 /2)w r N 3 z* r (i r (s)) = (l + e 7 /2)N 3 \x\ rMs) > N 3 



v r T{a' r )\x\L (1 + ai r (s))\x\ L l + ai r (s)' □ 

We now apply Corollary 9.1 and Lemma 9.6 to demonstrate Proposition 
9.5. 

Proof of Proposition 9.5. We will consider two cases for a given 
s < Nh t , depending on whether (9.44) holds. Suppose it does. Then, by 
Lemma 9.6, 

^ ^ aw r A r z*(i r (s)) > aw r N 3 



i> r r(cr' r ) l + ai r (s) 

which is a lower bound for the third term on the RHS of (9.42). 

On the other hand, if one chooses b > 8C3o/a 2 , then, since a < 1 and 
A r < 1/b 3 < 1, the first term on the RHS of (9.42) satisfies 

(9.50) C *> W ' N3 < ^ 



ab(l + as) ~ 4(1 +ai r (s))' 

which is 1/4 of the RHS of (9.49). Since s < N Hr , i r (s)<N + l. So, the sum 
of the first and third terms on the RHS of (9.42) is, for large N, at most 

3aiv r N 3 5 - 

(9.51) - — —rrz < — w r N 2 . 

v ; 4(1 + ai r (s)) ~ 8 

The second term on the RHS of (9.42) satisfies 

C 31 w r N 3/2 < lw r N 2 

for large N. Combining this with (9.51), one obtains from Corollary 9.1 that 

\X(N 3 )\ r<s -\x\ L <-± Wr N 2 , 

which implies (9.26) under (9.44). 

When (9.44) fails for s, one has, for large N, 

\X(N 3 )\ r , s - \x\ L = (^(iV 3 )]^ - \x\ rMs) ) - (\x\ L - \x\ rMs) ) 

< \X(N 3 )\ r , s - \x\ rMs) - \e 7 \x\ L 



(9.52) 



<\X A {N 3 )\ r ^-\C 2 iE 7 \x\ 
< C 2e N 3 - \C 2 ±e 7 N & < -N 5 , 



where, in the second inequality, we applied (6.9) and (7.11), and in the third 
inequality, we applied (8.5) of Proposition 8.1 and |x| > iV 6 . This implies 
(9.26) when (9.44) fails. □ 
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10. Conclusion: Upper bounds on ^[^(AT 3 )^]. In the preceding four 
sections, we obtained upper bounds on 

\X(N 3 )\ r , s -\x\ L and E X [\X(N 3 )\ L - \x\ L ;A(N 3 ) c ] 

under various assumptions. In Propositions 6.1 and 6.2 we showed that 
P X (A(N 3 ) C ) and the corresponding expectation E x [\X(N 3 )\l; A(N 3 ) c ] are 
small. In Proposition 7.1, we showed that the expected value of |X(iV 3 )| rjS — 
\x\l is small for s > Nn r - In Sections 8 and 9, we obtained pathwise estimates 
on A(N 3 ) when s < Nu r , depending on whether A r > 1/6 3 or A r < 1/b 3 . 
Proposition 8.2 gives an upper bound in the former subcase and Proposition 
9.5 gives an upper bound in the latter subcase. Except for Propositions 6.1 
and 6.2, we assumed that \x\ > iV 6 ; for the different results, we also required 
various side conditions. 

We tie these results together in Proposition 10.2 to obtain inequality 
(3.30) that was cited earlier. We do this in several steps, first combining the 
results for s < Njj r , then combining these with Proposition 7.1 for s > Nji r , 
and lastly including the bound from Proposition 6.1 on „4(iV 3 ) c . The first 
two steps are done in Proposition 10.1. As elsewhere in the paper, aN > 1 
is assumed. 

Proposition 10.1. Suppose that (9.1) holds for some e-j G (0,1] and 
that \x\ > N 6 , with \x\2 < Ss\x\, where es is specified below (9.13). Then, for 
large enough N and b, and small enough a, 



Note that the assumptions for the first half of Proposition 10.1 are the 
same as for Proposition 9.5, except that the restriction that A r < 1/6 3 has 
been removed. 

Proof of Proposition 10.1. Inequality (9.26) in Proposition 9.5 cov- 
ers the case where A r < 1/6 3 ; (8.11) of Proposition 8.2 covers the case where 
A r > 1/6 3 . Together, they imply (10.1). 

In order to demonstrate (10.2), we partition A(N 3 ) into GUH, with 



(10.1) |X(iV 3 )| r , s - \x\ L <-\w r N 2 



for io G „4,(iV 3 ), all r, and s with s < Nu r . Moreover, for large enough N 
and b, and small enough a, 



^[|X(iV 3 )| L -|x| L ;^(iV 3 )] 

(10.2) 





sup 

r,s>N Hr 
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Applying Proposition 7.1 to this G, with e 3 = (min r uv)/2, and applying 
(10.1) on H, it follows that the LHS of (10.2) equals 



E x 



sup \X{N 3 )\ r , s - \x\ L ;G +E X sup \X{N' 6 )\ r)S - \x\ L ;H 



r,s>N Hr 



r 3 \ 



r,s<N Hr 



(10.3) <C 3 (\x\ K /\x\)N 3 - (^minw r ^N 2 (2P x (G) + 2P x (H)-l) 

= C 3 (\x\ K /\x\)N 3 - (^minw r )jN 2 (2P x (A(N 3 )) - 1). 

This implies (10.2). □ 

We now obtain our desired result, Proposition 10.2, which gives upper 
bounds on E^fl-X^iV 3 )!/,] — \x\l- The first part of the proposition applies to 
all x; the second part requires that |x| > N 6 . 

PROPOSITION 10.2. Suppose that (9.1) holds for some e 7 G (0,1]. 

(a) For large enough N , 

(10.4) E x [\X(N 3 )\ L ]-\x\ L <C 3 N 3 for all x. 

(b) For \x\> N®, large enough N and b, and small enough a, 

(10.5) E X [\X(N 3 )\ L ] - \x\ L <C 3 (\x\ K /\x\)N 3 - Qmin^iV 2 . 

In both parts, C 3 is an appropriate constant that does not depend on x or 
N. 

PROOF. We first show (a). By (6.10) and (8.6) of Proposition 8.1, 

(10.6) \X(N 3 )\ L -\x\ L <C 26 N 3 

for all uj £ A\{N 3 ) and appropriate C26 > not depending on x, N, or u. 
Together with Proposition 6.1, this implies 

E x [\X{N 3 )\ L ]-\x\ L 

(10.7) =E x [\X(N 3 )\ L ;A(N 3 )] + E x [\X(N 3 )\ L ;A(N 3 y]-\x\ L 

< C 2 qN 3 + N 3 e- CwNZl1 < 2C 2& N 3 

for large enough N . For C 3 > 2C 2 q, this implies (10.4). 

For (b), we suppose first that \x\ 2 < esl^l; where e$ is given below (9.13). 
Then, (10.2) of Proposition 10.1, together with Propositions 6.1 and 6.2, 
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- \ X \L 



< C 3 (\x\ K /\x\)N 3 - (^minw r S ]N 2 (2P x (A(N 3 )) - 1) 



(10.8) 



+ N 3 e~ c ™ N3v 
<C 3 (\x\ K /\x\)N 3 - QminuvW 



for large N and b, and small a. This implies (10.5) for \x\2 < ^sM- 

Assume now that \x\2 > £sM- Choosing C 3 > (2C26 + jmin r w r )/es, it 
follows from (10.7) that, for large N, 



This implies (10.5) for \x\2 > £s\ x \- D 
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